JOURNAL ARTICLE

Overcoming asynchrony in Audio-Visual Speech Recognition

Abstract

In this paper we propose two alternatives to overcome the natural asynchrony of modalities in Audio-Visual Speech Recognition. We first investigate the use of asynchronous statistical models based on Dynamic Bayesian Networks with different levels of asynchrony. We show that audio-visual models should consider asynchrony within word boundaries and not at phoneme level. The second approach to the problem includes an additional processing of the features before being used for recognition. The proposed technique aligns the temporal evolution of the audio and video streams in terms of a speech-recognition system and enables the use of simpler statistical models for classification. On both cases we report experiments with the CUAVE database, showing the improvements obtained with the proposed asynchronous model and feature processing technique compared to traditional systems.

Keywords:
Asynchrony (computer programming) Computer science Speech recognition Asynchronous communication Feature (linguistics) Audio mining Artificial intelligence Modalities Hidden Markov model Speech processing Pattern recognition (psychology) Machine learning Acoustic model

Metrics

2
Cited By
0.00
FWCI (Field Weighted Citation Impact)
21
Refs
0.17
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech

Magnus AlmDawn M. Behne

Journal:   The Journal of the Acoustical Society of America Year: 2013 Vol: 134 (4)Pages: 3001-3010
JOURNAL ARTICLE

Audio visual speech recognition

Robert L. Beadles

Journal:   The Journal of the Acoustical Society of America Year: 1990 Vol: 87 (5)Pages: 2274-2274
© 2026 ScienceGate Book Chapters — All rights reserved.