JOURNAL ARTICLE

Text-Independent Speaker Verification with Adversarial Learning on Short Utterances

Abstract

A text-independent speaker verification system suffers severe performance degradation under short utterance condition. To address the problem, in this paper, we propose an adversarially learned embedding mapping model that directly maps a short embedding to an enhanced embedding with increased discriminability. In particular, a Wasserstein GAN with a bunch of loss criteria are investigated. These loss functions have distinct optimization objectives and some of them are less favoured for the speaker verification research area. Different from most prior studies, our main objective in this study is to investigate the effectiveness of those loss criteria by conducting numerous ablation studies. Experiments on Voxceleb dataset showed that some criteria are beneficial to the verification performance while some have trivial effects. Lastly, a Wasserstein GAN with chosen loss criteria, without finetuning, achieves meaningful advancements over the baseline, with 4% relative improvements on EER and 7% on minDCF in the challenging scenario of short 2second utterances.

Keywords:
Speaker verification Embedding Utterance Computer science Speech recognition Baseline (sea) Degradation (telecommunications) Adversarial system Artificial intelligence Speaker recognition

Metrics

14
Cited By
1.62
FWCI (Field Weighted Citation Impact)
26
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Text-independent speaker recognition with short utterances

Kairui LiEdwin H. Wrench

Journal:   The Journal of the Acoustical Society of America Year: 1982 Vol: 72 (S1)Pages: S29-S30
JOURNAL ARTICLE

A deep learning approach for text-independent speaker recognition with short utterances

Rania ChakrounMondher Frikha

Journal:   Multimedia Tools and Applications Year: 2023 Vol: 82 (21)Pages: 33111-33133
JOURNAL ARTICLE

Robust features for text-independent speaker recognition with short utterances

Rania ChakrounMondher Frikha

Journal:   Neural Computing and Applications Year: 2020 Vol: 32 (17)Pages: 13863-13883
© 2026 ScienceGate Book Chapters — All rights reserved.