JOURNAL ARTICLE

Information fusion techniques in Audio-Visual Speech Recognition

Abstract

It is well known that human perception of speech relies both on audio and visual information. However, the physiology of information fusion process in humans is still indefinite which attracts scientists' attention to information fusion process for audio-visual speech recognition. In this work, a novel tandem hybrid approach is introduced for an efficient audio-visual speech recognition system and the performance of the proposed technique is experimentally compared with the widely used Multiple Stream Hidden Markov Model (MSHMM) approach.

Keywords:
Computer science Hidden Markov model Speech recognition Audio visual Audio mining Process (computing) Perception Artificial intelligence Visualization Information fusion Sensor fusion Speech processing Pattern recognition (psychology) Voice activity detection Multimedia

Metrics

1
Cited By
0.35
FWCI (Field Weighted Citation Impact)
9
Refs
0.61
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Analysis of multimodal fusion techniques for audio-visual speech recognition

Denis IvankoIrina KipyatkovaAndrey RonzhinAlexey Karpov

Journal:   Scientific and technical journal of information technologies mechanics and optics Year: 2016 Pages: 387-401
JOURNAL ARTICLE

Bimodal fusion in audio-visual speech recognition

Xiaozheng ZhangR.M. MersereauM. Clements

Journal:   Proceedings - International Conference on Image Processing Year: 2003 Vol: 1 Pages: I-964
© 2026 ScienceGate Book Chapters — All rights reserved.