JOURNAL ARTICLE

Audio-visual continuous speech recognition using a coupled hidden Markov model

Abstract

With the increase in the computational complexity of recent computers, audio-visual speech recognition (AVSR) became an attractive research topic that can lead to a robust solution for speech recognition in noisy environments. In the audio visual continuous speech recognition system presented in this paper, the audio and visual observation sequences are integrated using a coupled hidden Markov model (CHMM). The statistical properties of the CHMM can describe the asyncrony of the audio and visual features while preserving their natural correlation over time. The experimental results show that the current system tested on the XM2VTS database reduces the error rate of the audio only speech recognition system at SNR of 0db by over 55%.

Keywords:
Hidden Markov model Speech recognition Computer science Audio visual Audio mining Markov model Artificial intelligence Speaker recognition Word error rate Pattern recognition (psychology) Acoustic model Speech processing Markov chain Machine learning Multimedia

Metrics

42
Cited By
3.14
FWCI (Field Weighted Citation Impact)
10
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Advanced Data Compression Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Audio-visual speech modeling using coupled hidden Markov models

Stephen M. ChuThomas S. Huang

Journal:   IEEE International Conference on Acoustics Speech and Signal Processing Year: 2002 Pages: II-2009
JOURNAL ARTICLE

Audio-visual speech modeling using coupled hidden Markov models

ChuHuang

Journal:   IEEE International Conference on Acoustics Speech and Signal Processing Year: 2002 Pages: II-II
JOURNAL ARTICLE

Coupled hidden Markov model (CHMM) for continuous audiovisual speech recognition

Ara Nefian

Journal:   The Journal of the Acoustical Society of America Year: 2010 Vol: 128 (4)Pages: 2259-2259
JOURNAL ARTICLE

Continuous speech recognition using hidden Markov models

J. Picone

Journal:   IEEE ASSP Magazine Year: 1990 Vol: 7 (3)Pages: 26-41
© 2026 ScienceGate Book Chapters — All rights reserved.