Vector quantization techniques for GMM based speaker verification

Gurkirat Singh; Amlana Panda; Suvanjan Bhattacharyya; Thambipillai Srikanthan

doi:10.1109/icassp.2003.1202295

ScienceGate Book Chapters

JOURNAL ARTICLE

Vector quantization techniques for GMM based speaker verification

Gurkirat Singh Amlana Panda Suvanjan Bhattacharyya Thambipillai Srikanthan

Year: 2003 Journal: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Vol: 2 Pages: II-65

DOI: 10.1109/icassp.2003.1202295

Get Full-Text PDF Get Analytical Report

Abstract

This paper explores the novel application of two vector quantization algorithms, namely Linde, Buzo, Gray (1980) and K-means algorithm for efficient speaker verification. Automatic speaker verification (ASV) is a memory and compute intensive process, giving rise to area and latency concerns in the way of its implementation for real-time efficient embedded systems. The training schemes for computing the speaker models, such as the expectation maximization are highly iterative and contribute significantly to the overall complexity in the implementation of the system. We demonstrate the use of the LBG and the K-means algorithm to realize compute efficient training method. Models trained with the LBG algorithm achieves as much as 99.88% of EM accuracy, whilst K-means achieves as much as 99.91% of EM accuracy. Moreover, the EM computational complexity is almost twice that of LBG or K-means. Thus, using LBG and K-means algorithms for training Gaussian mixture speaker models for text-independent speaker verification, we show that, that they deliver comparable performance as the EM algorithm at significantly reduced computational complexity. Thus making them an ideal choice for low-cost applications.

Keywords:

Computer science Vector quantization Computational complexity theory Speaker verification Quantization (signal processing) Algorithm Speaker recognition Mixture model Maximization Gaussian process Artificial intelligence Gaussian Speech recognition Mathematical optimization Mathematics

Metrics

Cited By

1.68

FWCI (Field Weighted Citation Impact)

Refs

0.89

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Vector quantization techniques for GMM based speaker verification

Abstract

Metrics

Citation History

Topics

Related Documents

Speaker Verification Based on Different Vector Quantization Techniques with Gaussian Mixture Models

Speaker Verification Based on Information Theoretic Vector Quantization

Vector quantization based Gaussian modeling for speaker verification

Speaker verification with discriminative vector quantization

Vector Quantization Mappings for Speaker Verification