Boundary Discriminative Large Margin Cosine Loss for Text-independent Speaker Verification

Rongjin Li; Na Li; Deyi Tuo; Meng Yu; Dan Su; Dong Yu

doi:10.1109/icassp.2019.8682749

ScienceGate Book Chapters

JOURNAL ARTICLE

Boundary Discriminative Large Margin Cosine Loss for Text-independent Speaker Verification

Rongjin Li Na Li Deyi Tuo Meng Yu Dan Su Dong Yu

Year: 2019 Pages: 6321-6325

DOI: 10.1109/icassp.2019.8682749

Get Full-Text PDF Get Analytical Report

Abstract

Deep neural network based speaker embeddings have attracted much attention in text-independent speaker verification task. In addition to the network architecture, an appropriate design of the loss function is crucial for the deep discriminative embedding extractor. Inspired by the success of Large Margin Cosine Loss (LMCL) in face recognition, we propose an enhanced LMCL named boundary discriminative LMCL (BD-LMCL) to emphasize the discriminative information inherited in the speaker boundaries. Unlike LMCL, where all training samples contribute equally for the objective function, only the samples around the speaker boundaries are considered during the network training with BD-LMCL. Specifically, those samples close to the boundaries are dynamically selected using top-k zero-one loss. Experimental results on a short duration corpus Android Cellphone and NIST SRE 2012 demonstrate better performance compared to LMCL and other popular loss functions.

Keywords:

Discriminative model Speech recognition Computer science NIST Margin (machine learning) Pattern recognition (psychology) Speaker recognition Artificial intelligence Embedding Boundary (topology) Machine learning Mathematics

Metrics

Cited By

3.53

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Boundary Discriminative Large Margin Cosine Loss for Text-independent Speaker Verification

Abstract

Metrics

Citation History

Topics

Related Documents

Speaker verification using large margin GMM discriminative training

Large Margin Softmax Loss for Speaker Verification

Maximum model distance discriminative training for text-independent speaker verification

Discriminative Transformation for Sufficient Adaptation in Text-Independent Speaker Verification

Masked Proxy Loss for Text-Independent Speaker Verification