Maximum model distance discriminative training for text-independent speaker verification

Qingyang Hong; Sam Kwong

doi:10.1109/iecon.2004.1431850

ScienceGate Book Chapters

JOURNAL ARTICLE

Maximum model distance discriminative training for text-independent speaker verification

Qingyang Hong Sam Kwong

Year: 2005 Vol: 2 Pages: 1769-1774

DOI: 10.1109/iecon.2004.1431850

Get Full-Text PDF Get Analytical Report

Abstract

This paper presents the design and implementation of text-independent speaker verification. We apply the maximum model distance (MMD) algorithm to the Gaussian mixture model (GMM) training. The traditional maximum likelihood (ML) method only utilizes the labeled utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the model distance between the target and competing speakers, MMD could add the discriminative capability into the training procedure and then improve the verification performance. Based on the TIMIT corpus, we designed the verification experiments and the results show that the equal error rate (EER) could be reduced greatly compared with the traditional ML method.

Keywords:

Discriminative model TIMIT Computer science Mixture model Speaker verification Word error rate Speech recognition Maximum likelihood Artificial intelligence Pattern recognition (psychology) Gaussian Gaussian process Speaker recognition Hidden Markov model Mathematics Statistics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.16

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Maximum model distance discriminative training for text-independent speaker verification

Abstract

Metrics

Topics

Related Documents

Discriminative training for speaker identification based on maximum model distance algorithm

Orthogonal Training for Text-Independent Speaker Verification

Discriminative Transformation for Sufficient Adaptation in Text-Independent Speaker Verification

Studies on Model Distance Normalization Approach in Text-independent Speaker Verification

A discriminative training approach for text-independent speaker recognition