JOURNAL ARTICLE

Histogram equalization and noise masking for robust speech recognition

Abstract

Mismatch between training and test conditions deteriorates the performance of speech recognizers. This paper investigates the combination of parametric histogram equalization (pHEQ) and noise masking to compensate for the mismatch caused by additive noise. The proposed front-end maps the distribution of the observed power spectrum vectors to a target distribution. The target distribution matches the distribution of the noise free training data except for an artificially reduced signal-to-noise ratio. Different power spectrum estimation algorithms are used to estimate the noise distribution as used internally by pHEQ more reliably under nonstationary noise conditions. The proposed front-end is evaluated on the Aurora4 database and shows a significant improvement w.r.t. mean-normalized Mel-frequency spectral coefficients. Moreover, the performance could be further improved if better estimates of the instantaneous noise power spectrum were available.

Keywords:
Noise (video) Noise power Speech recognition Computer science Noise measurement Histogram Masking (illustration) Value noise Speech enhancement Parametric statistics Spectral density Pattern recognition (psychology) Mathematics Noise floor Artificial intelligence Power (physics) Noise reduction Statistics Telecommunications Physics

Metrics

2
Cited By
0.00
FWCI (Field Weighted Citation Impact)
9
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Quantile based histogram equalization for noise robust speech recognition

Hilger, Florian Erich

Journal:   RWTH Publications (RWTH Aachen) Year: 2004
JOURNAL ARTICLE

A time-synchronous histogram equalization for noise robust speech recognition

Fumiya TakahashiMasaharu KatoTetsuo Kosaka

Journal:   The Journal of the Acoustical Society of America Year: 2013 Vol: 133 (5_Supplement)Pages: 3247-3247
JOURNAL ARTICLE

Histogram equalization with Bayesian estimation for noise robust speech recognition

Young-Joo SuhHoirin Kim

Journal:   The Journal of the Acoustical Society of America Year: 2018 Vol: 143 (2)Pages: 677-685
© 2026 ScienceGate Book Chapters — All rights reserved.