JOURNAL ARTICLE

Information Theoretic Feature Extraction for Audio-Visual Speech Recognition

Mihai GurbanJean‐Philippe Thiran

Year: 2009 Journal:   IEEE Transactions on Signal Processing Vol: 57 (12)Pages: 4765-4776   Publisher: Institute of Electrical and Electronics Engineers

Abstract

The problem of feature selection has been thoroughly analyzed in the context of pattern classification, with the purpose of avoiding the curse of dimensionality. However, in the context of multimodal signal processing, this problem has been studied less. Our approach to feature extraction is based on information theory, with an application on multimodal classification, in particular audio–visual speech recognition. Contrary to previous work in information theoretic feature selection applied to multimodal signals, our proposed methods penalize features for their redundancy, achieving more compact feature sets and better performance. We propose two greedy selection algorithms, one that penalizes a proportion of feature redundancy, while the other uses conditional mutual information as an evaluation measure, for the selection of visual features for audio–visual speech recognition. Our features perform better than linear discriminant analysis, the most usual transform for dimensionality reduction in the field, across a wide range of dimensionality values and combined with audio at different quality levels.

Keywords:
Speech recognition Computer science Feature extraction Audio mining Artificial intelligence Pattern recognition (psychology) Audio visual Feature (linguistics) Information extraction Speech processing Information theory Speaker recognition Voice activity detection Mathematics

Metrics

66
Cited By
5.91
FWCI (Field Weighted Citation Impact)
69
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.