Robust speech recognition using articulatory gestures in a Dynamic Bayesian Network framework

Vikramjit Mitra; Hosung Nam; Carol Espy-Wilson

doi:10.1109/asru.2011.6163918

ScienceGate Book Chapters

JOURNAL ARTICLE

Robust speech recognition using articulatory gestures in a Dynamic Bayesian Network framework

Vikramjit Mitra Hosung Nam Carol Espy-Wilson

Year: 2011 Vol: 18 Pages: 131-136

DOI: 10.1109/asru.2011.6163918

Get Full-Text PDF Get Analytical Report

Abstract

Articulatory Phonology models speech as spatio-temporal constellation of constricting events (e.g. raising tongue tip, narrowing lips etc.), known as articulatory gestures. These gestures are associated with distinct organs (lips, tongue tip, tongue body, velum and glottis) along the vocal tract. In this paper we present a Dynamic Bayesian Network based speech recognition architecture that models the articulatory gestures as hidden variables and uses them for speech recognition. Using the proposed architecture we performed: (a) word recognition experiments on the noisy data of Aurora-2 and (b) phone recognition experiments on the University of Wisconsin X-ray microbeam database. Our results indicate that the use of gestural information helps to improve the performance of the recognition system compared to the system using acoustic information only.

Keywords:

Gesture Speech recognition Computer science Vocal tract Artificial intelligence

Metrics

Cited By

1.18

FWCI (Field Weighted Citation Impact)

Refs

0.83

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Robust speech recognition using articulatory gestures in a Dynamic Bayesian Network framework

Abstract

Metrics

Citation History

Topics

Related Documents

Robust speech recognition with articulatory features using dynamic Bayesian networks

Recognizing articulatory gestures from speech for robust speech recognition

Dynamic Bayesian Network Inversion for Robust Speech Recognition

Robust word recognition using articulatory trajectories and gestures

Robust modeling and recognition of hand gestures with dynamic Bayesian network