JOURNAL ARTICLE

Speaker Verification Based on Different Vector Quantization Techniques with Gaussian Mixture Models

Abstract

The introduction of Gaussian Mixture Models (GMMs) in the field of speaker verification has led to very good results. This paper illustrates an evolution in state-of-the-art Speaker Verification by highlighting the contribution of recently established information theoretic based vector quantization technique. We explore the novel application of three different vector quantization algorithms, namely K-means, Linde-Buzo-Gray (LBG) and Information Theoretic Vector Quantization (ITVQ) for efficient speaker verification. The Expectation Maximization (EM) algorithm used by GMM requires a prohibitive amount of iterations to converge. In this paper, comparable alternatives to EM including K-means, LBG and ITVQ algorithm were tested. The GMM-ITVQ algorithm was found to be the most efficient alternative for the GMM-EM. It gives correct classification rates at a similar level to that of GMM-EM. Finally, representative performance benchmarks and system behaviour experiments on NIST SRE corpora are presented.

Keywords:
Vector quantization Mixture model Speaker verification Computer science Quantization (signal processing) NIST Gaussian Expectation–maximization algorithm Linde–Buzo–Gray algorithm Algorithm Speaker recognition Pattern recognition (psychology) Maximization Artificial intelligence Speech recognition Mathematics Maximum likelihood Mathematical optimization Statistics

Metrics

20
Cited By
2.67
FWCI (Field Weighted Citation Impact)
20
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Bayesian Methods and Mixture Models
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.