JOURNAL ARTICLE

Relevant Feature Selection for Audio-Visual Speech Recognition

Abstract

We present a feature selection method based on information theoretic measures, targeted at multimodal signal processing, showing how we can quantitatively assess the relevance of features from different modalities. We are able to find the features with the highest amount of information relevant for the recognition task, and at the same having minimal redundancy. Our application is audio-visual speech recognition, and in particular selecting relevant visual features. Experimental results show that our method outperforms other feature selection algorithms from the literature by improving recognition accuracy even with a significantly reduced number of features.

Keywords:
Computer science Feature selection Redundancy (engineering) Speech recognition Pattern recognition (psychology) Feature (linguistics) Artificial intelligence Modalities Relevance (law) Selection (genetic algorithm) Feature extraction Task (project management) Visualization

Metrics

23
Cited By
1.86
FWCI (Field Weighted Citation Impact)
19
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Visual feature analysis for audio-visual speech recognition

Ivana Arsic

Journal:   Infoscience (Ecole Polytechnique Fédérale de Lausanne) Year: 2008
JOURNAL ARTICLE

Information Theoretic Feature Extraction for Audio-Visual Speech Recognition

Mihai GurbanJean‐Philippe Thiran

Journal:   IEEE Transactions on Signal Processing Year: 2009 Vol: 57 (12)Pages: 4765-4776
JOURNAL ARTICLE

Audio visual speech recognition

Robert L. Beadles

Journal:   The Journal of the Acoustical Society of America Year: 1990 Vol: 87 (5)Pages: 2274-2274
BOOK-CHAPTER

Speech Recognition, Audio-Visual

Gerasimos Potamianos

Elsevier eBooks Year: 2006 Pages: 800-805
© 2026 ScienceGate Book Chapters — All rights reserved.