Distributed speaker recognition using earth mover's distance

Yoshiyuki Umeda; Shingo Kuroiwa; Satoru Tsuge; Fuji Ren

doi:10.21437/interspeech.2004-538

ScienceGate Book Chapters

JOURNAL ARTICLE

Distributed speaker recognition using earth mover's distance

Yoshiyuki Umeda Shingo Kuroiwa Satoru Tsuge Fuji Ren

Year: 2004 Pages: 2389-2392

DOI: 10.21437/interspeech.2004-538

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we focus on distributed speaker recognition, a technique in which quantized feature parameters are sent to a server, as with distributed speech recognition. The Gaussian mixture model , the traditional method used for speaker recognition, is trained using the maximum likelihood approach. The GMM has output probability functions with continuous density functions. It is difficult to fit continuous density functions to quantized data. To overcome this problem, we propose a novel speaker recognition technique which does not need speaker model training. The proposed method directly calculates the distance between a set of quantized feature parameters of registered speech and a set of quantized feature parameters of test speech. To measure distance, we use Earth Mover’s Distance (EMD). The EMD has recently been successfully applied to image retrieval. We conduct text-independent speaker identification experiments using the proposed method. When compared to results using the traditional GMM, the proposed method yielded relative error reductions of 80% for quantized data.

Keywords:

Earth mover's distance Speaker recognition Computer science Mixture model Speech recognition Pattern recognition (psychology) Feature (linguistics) Feature extraction Artificial intelligence Set (abstract data type) Gaussian Speaker diarisation Focus (optics) Measure (data warehouse) Data mining

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.01

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Distributed speaker recognition using earth mover's distance

Abstract

Metrics

Topics

Related Documents

Distributed speaker recognition using speaker-dependent VQ codebook and earth mover's distance

Nonparametric Speaker Recognition Method Using Earth Mover's Distance

Recognition of Abnormal Red Blood Cells Using Earth Mover's Distance Algorithm

Fast and Robust Speaker Clustering Using the Earth Mover'S Distance and Mixmax Models

Business Forms Classification Using Earth Mover's Distance