Recognition of noisy speech

A. Erell; M. Weintraub

doi:10.3115/116580.116718

ScienceGate Book Chapters

JOURNAL ARTICLE

Recognition of noisy speech

A. Erell M. Weintraub

Year: 1990 Pages: 341-345

DOI: 10.3115/116580.116718

Get Full-Text PDF Get Analytical Report

Abstract

A model-based spectral estimation algorithm is derived that improves the robustness of speech recognition systems to additive noise. The algorithm is tailored for filter-bank-based systems, where the estimation should seek to minimize the distortion as measured by the recognizer's distance metric. This estimation criterion is approximated by minimizing the Euclidean distance between spectral log-energy vectors, which is equivalent to minimizing the nonweighted, nontruncated cepstral distance. Correlations between frequency channels are incorporated in the estimation by modeling the spectral distribution of speech as a mixture of components, each representing a different speech class, and assuming that spectral energies at different frequency channels are uncorrelated within each class. The algorithm was tested with SRI's continuous-speech, speaker-independent, hidden Markov model recognition system using the large-vocabulary NIST "Resource Management Task." When trained on a clean-speech database and tested with additive white Gaussian noise, the new algorithm has an error rate half of that with MMSE estimation of log spectral energies at individual frequency channels, and it achieves a level similar to that with the ideal condition of training and testing at constant SNR. The algorithm is also very efficient with additive environmental noise, recorded with a desktop microphone.

Keywords:

Speech recognition Computer science Hidden Markov model Robustness (evolution) Speech enhancement Mixture model Microphone Speaker recognition Word error rate Additive white Gaussian noise Speech processing Pattern recognition (psychology) Artificial intelligence White noise Noise reduction Sound pressure

Metrics

Cited By

1.54

FWCI (Field Weighted Citation Impact)

Refs

0.83

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Recognition of noisy speech

Abstract

Metrics

Citation History

Topics

Related Documents

Automatic Recognition of Noisy Speech

Speech recognition in noisy environments

Continuous Kannada Noisy Speech Recognition

Emotion Recognition from Noisy Speech

Noisy Speech Recognition Based on Speech Enhancement