Abstract

Automatic speech recognition (ASR) technology has become more accessible in the last decade. However, the performance of state of the art ASR systems is still far from being optimal in comparison to human performance. The robustness of common ASR systems is very limited. A possible improvement can be in the low-level acoustic-phonetic modeling. For example, improvement can be obtained by applying the recognition mechanism in parallel on nonoverlapping sub-bands. In order to show that such a mechanism can be beneficial, we have tested human ability to recognize speech embedded in a noisy background in non-overlapping sub-bands. The human performances were compared to a typical Hidden-Markov-Model (HMM) based ASR system (HTK). Consequently, we conclude that speech information exists in different and non-overlapping sub-bands with almost no significance to the central frequencies of the sub-bands. Using sub-band processes together with traditional processing, can obtain better automatic speech recognition.

Keywords:
Hidden Markov model Speech recognition Computer science Robustness (evolution) Speech processing Acoustic model Artificial intelligence Voice activity detection Speaker recognition Pattern recognition (psychology)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.06
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Multi-Channel Sub-Band Speech Recognition

Iain McCowanSridha Sridharan

Journal:   EURASIP Journal on Advances in Signal Processing Year: 2001 Vol: 2001 (1)
JOURNAL ARTICLE

Microphone array sub-band speech recognition

Iain McCowanS. Sridharan

Year: 2002 Vol: 1 Pages: 185-188
JOURNAL ARTICLE

Sub-band based recognition of noisy speech

Sangita TibrewalaHynek Heřmanský

Year: 2002 Vol: 2 Pages: 1255-1258
JOURNAL ARTICLE

Sub-band weighted projection measure for sub-band speech recognition in noise

Babak NasersharifAhmad Akbari

Journal:   Electronics Letters Year: 2006 Vol: 42 (14)Pages: 829-831
© 2026 ScienceGate Book Chapters — All rights reserved.