Text-Independent Speaker Verification with Adversarial Learning on Short Utterances

Kai Liu; Huan Zhou

doi:10.1109/icassp40776.2020.9054036

ScienceGate Book Chapters

JOURNAL ARTICLE

Text-Independent Speaker Verification with Adversarial Learning on Short Utterances

Kai Liu Huan Zhou

Year: 2020 Pages: 6569-6573

DOI: 10.1109/icassp40776.2020.9054036

Get Full-Text PDF Get Analytical Report

Abstract

A text-independent speaker verification system suffers severe performance degradation under short utterance condition. To address the problem, in this paper, we propose an adversarially learned embedding mapping model that directly maps a short embedding to an enhanced embedding with increased discriminability. In particular, a Wasserstein GAN with a bunch of loss criteria are investigated. These loss functions have distinct optimization objectives and some of them are less favoured for the speaker verification research area. Different from most prior studies, our main objective in this study is to investigate the effectiveness of those loss criteria by conducting numerous ablation studies. Experiments on Voxceleb dataset showed that some criteria are beneficial to the verification performance while some have trivial effects. Lastly, a Wasserstein GAN with chosen loss criteria, without finetuning, achieves meaningful advancements over the baseline, with 4% relative improvements on EER and 7% on minDCF in the challenging scenario of short 2second utterances.

Keywords:

Speaker verification Embedding Utterance Computer science Speech recognition Baseline (sea) Degradation (telecommunications) Adversarial system Artificial intelligence Speaker recognition

Metrics

Cited By

1.62

FWCI (Field Weighted Citation Impact)

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Text-Independent Speaker Verification with Adversarial Learning on Short Utterances

Abstract

Metrics

Citation History

Topics

Related Documents

Text-independent speaker recognition with short utterances

End-to-End Text-Independent Speaker Verification with Triplet Loss on Short Utterances

A deep learning approach for text-independent speaker recognition with short utterances

An approach to text-independent speaker recognition with short utterances

Robust features for text-independent speaker recognition with short utterances