JOURNAL ARTICLE

Vector quantization techniques for GMM based speaker verification

Gurkirat SinghAmlana PandaSuvanjan BhattacharyyaThambipillai Srikanthan

Year: 2003 Journal:   2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Vol: 2 Pages: II-65

Abstract

This paper explores the novel application of two vector quantization algorithms, namely Linde, Buzo, Gray (1980) and K-means algorithm for efficient speaker verification. Automatic speaker verification (ASV) is a memory and compute intensive process, giving rise to area and latency concerns in the way of its implementation for real-time efficient embedded systems. The training schemes for computing the speaker models, such as the expectation maximization are highly iterative and contribute significantly to the overall complexity in the implementation of the system. We demonstrate the use of the LBG and the K-means algorithm to realize compute efficient training method. Models trained with the LBG algorithm achieves as much as 99.88% of EM accuracy, whilst K-means achieves as much as 99.91% of EM accuracy. Moreover, the EM computational complexity is almost twice that of LBG or K-means. Thus, using LBG and K-means algorithms for training Gaussian mixture speaker models for text-independent speaker verification, we show that, that they deliver comparable performance as the EM algorithm at significantly reduced computational complexity. Thus making them an ideal choice for low-cost applications.

Keywords:
Computer science Vector quantization Computational complexity theory Speaker verification Quantization (signal processing) Algorithm Speaker recognition Mixture model Maximization Gaussian process Artificial intelligence Gaussian Speech recognition Mathematical optimization Mathematics

Metrics

36
Cited By
1.68
FWCI (Field Weighted Citation Impact)
7
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.