JOURNAL ARTICLE

Using Gaussian mixture modeling in speech recognition

Abstract

The paper describes a speaker-independent isolated word recognition system which uses a well known technique, the combination of vector quantization with hidden Markov modeling. The conventional vector quantization algorithm is substituted by a statistical clustering algorithm, the expectation-maximization algorithm, in this system. Based on the investigation of the data space, the phonemes were manually extracted from the training data and were used to generate the Gaussians in a code book in which each code word is a Gaussian rather than a centroid vector of the data class. Word-based hidden Markov modeling was then performed. Two English isolated digits data bases were investigated and the 12 Mel-spaced filter bank coefficients employed as the input feature. Compared with the conventional discrete HMM, the present system obtained a significant improvement of recognition accuracy.< >

Keywords:
Computer science Speech recognition Mixture model Artificial intelligence Gaussian Pattern recognition (psychology) Physics

Metrics

17
Cited By
0.68
FWCI (Field Weighted Citation Impact)
11
Refs
0.72
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Data Compression Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Bayesian Methods and Mixture Models
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Variable parameter Gaussian mixture hidden Markov modeling for speech recognition

Xiaodong CuiYifan Gong

Journal:   2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Year: 2003 Vol: 1 Pages: I-12
© 2026 ScienceGate Book Chapters — All rights reserved.