Robust word recognition using articulatory trajectories and gestures

Vikramjit Mitra; Hosung Nam; Carol Espy-Wilson; Elliot Saltzman; Louis Goldstein

doi:10.21437/interspeech.2010-576

ScienceGate Book Chapters

JOURNAL ARTICLE

Robust word recognition using articulatory trajectories and gestures

Vikramjit Mitra Hosung Nam Carol Espy-Wilson Elliot Saltzman Louis Goldstein

Year: 2010 Pages: 2038-2041

DOI: 10.21437/interspeech.2010-576

Get Full-Text PDF Get Analytical Report

Abstract

Articulatory Phonology views speech as an ensemble of constricting events (e.g. narrowing lips, raising tongue tip), gestures, at distinct organs (lips, tongue tip, tongue body, velum, and glottis) along the vocal tract. This study shows that articulatory information in the form of gestures and their output trajectories (tract variable time functions or TVs) can help to improve the performance of automatic speech recognition systems. The lack of any natural speech database containing such articulatory information prompted us to use a synthetic speech dataset (obtained from Haskins Laboratories TAsk Dynamic model of speech production) that contains acoustic waveform for a given utterance and its corresponding gestures and TVs. First, we propose neural network based models to recognize the gestures and estimate the TVs from acoustic information. Second, the “synthetic-data trained” articulatory models were applied to the natural speech utterances in Aurora-2 corpus to estimate their gestures and TVs. Finally, we show that the estimated articulatory information helps to improve the noise robustness of a word recognition system when used along with the cepstral

Keywords:

Gesture Speech recognition Computer science Vocal tract Speech production Utterance Robustness (evolution) Artificial intelligence

Metrics

Cited By

2.81

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Phonetics and Phonology Research

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Robust word recognition using articulatory trajectories and gestures

Abstract

Metrics

Citation History

Topics

Related Documents

Recognizing articulatory gestures from speech for robust speech recognition

Robust speech recognition using articulatory gestures in a Dynamic Bayesian Network framework

Articulatory gestures are insensitive to within-word context

Articulatory trajectories for large-vocabulary speech recognition

Coordination of Word Onset Articulatory Gestures in Swedish: Anticipatory Cues to Word Accents