JOURNAL ARTICLE

Robust speech recognition using articulatory gestures in a Dynamic Bayesian Network framework

Abstract

Articulatory Phonology models speech as spatio-temporal constellation of constricting events (e.g. raising tongue tip, narrowing lips etc.), known as articulatory gestures. These gestures are associated with distinct organs (lips, tongue tip, tongue body, velum and glottis) along the vocal tract. In this paper we present a Dynamic Bayesian Network based speech recognition architecture that models the articulatory gestures as hidden variables and uses them for speech recognition. Using the proposed architecture we performed: (a) word recognition experiments on the noisy data of Aurora-2 and (b) phone recognition experiments on the University of Wisconsin X-ray microbeam database. Our results indicate that the use of gestural information helps to improve the performance of the recognition system compared to the system using acoustic information only.

Keywords:
Gesture Speech recognition Computer science Vocal tract Artificial intelligence

Metrics

9
Cited By
1.18
FWCI (Field Weighted Citation Impact)
33
Refs
0.83
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Robust speech recognition with articulatory features using dynamic Bayesian networks

Vikramjit MitraHosung NamCarol Espy-WilsonElliot SaltzmanLouis Goldstein

Journal:   The Journal of the Acoustical Society of America Year: 2011 Vol: 130 (4_Supplement)Pages: 2408-2408
JOURNAL ARTICLE

Recognizing articulatory gestures from speech for robust speech recognition

Vikramjit MitraHosung NamCarol Espy-WilsonElliot SaltzmanLouis Goldstein

Journal:   The Journal of the Acoustical Society of America Year: 2012 Vol: 131 (3)Pages: 2270-2287
JOURNAL ARTICLE

Dynamic Bayesian Network Inversion for Robust Speech Recognition

Li XieHailong Yang

Journal:   IEICE Transactions on Information and Systems Year: 2007 Vol: E90-D (7)Pages: 1117-1120
JOURNAL ARTICLE

Robust modeling and recognition of hand gestures with dynamic Bayesian network

Heung‐Il SukBong-Kee SinSeong‐Whan Lee

Journal:   Proceedings - International Conference on Pattern Recognition/Proceedings/International Conference on Pattern Recognition Year: 2008 Vol: 11 Pages: 1-4
© 2026 ScienceGate Book Chapters — All rights reserved.