JOURNAL ARTICLE

Speech enhancement based on joint time-frequency segmentation

Abstract

We present an algorithm to decompose speech into transient and non-transient components. Our algorithm, the joint time-frequency segmentation algorithm, uses the wavelet packet coefficients of the speech signal and represents them as tiles of a time-frequency representation adapted to the characteristics of the signal itself. Any wavelet packet coefficient, whose tiling height is larger than or equal to the tiling width is characterized as a transient coefficient and vice versa for the non-transient coefficient. The transient component is selectively amplified and recombined with the original speech to generate the modified speech with energy adjusted to be equal to the energy of the original speech. The psychoacoustic tests performed with fourteen human listeners show that the speech modification significantly improves speech intelligibility in background noise, i.e., for 10% absolute at 0d B to 31% absolute at -30 dB.

Keywords:
Speech recognition Wavelet packet decomposition Computer science Segmentation Wavelet Energy (signal processing) Speech enhancement Intelligibility (philosophy) Transient (computer programming) Wavelet transform Algorithm Mathematics Artificial intelligence Noise reduction Statistics

Metrics

3
Cited By
0.70
FWCI (Field Weighted Citation Impact)
11
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Hearing Loss and Rehabilitation
Life Sciences →  Neuroscience →  Cognitive Neuroscience
Advanced Adaptive Filtering Techniques
Physical Sciences →  Engineering →  Computational Mechanics

Related Documents

JOURNAL ARTICLE

Joint Time–Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement

Charturong TantibundhitFranz PernkopfGernot Kubin

Journal:   IEEE Transactions on Audio Speech and Language Processing Year: 2009 Vol: 18 (6)Pages: 1417-1428
JOURNAL ARTICLE

Speech preprocessing and enhancement based on joint time domain and time-frequency domain analysis

Wenbo ZhangXuefeng XieYanling DuDongmei Huang

Journal:   The Journal of the Acoustical Society of America Year: 2024 Vol: 155 (6)Pages: 3580-3588
JOURNAL ARTICLE

Wavelet-Based Speech Enhancement Using Time-Frequency Adaptation

Kun-Ching Wang

Journal:   EURASIP Journal on Advances in Signal Processing Year: 2003 Vol: 2009 (1)
JOURNAL ARTICLE

Speech Enhancement Based on Time-Frequency Domain GAN

YIN Wen-bing, GAO Ge, ZENG Bang, WANG Xiao, CHEN Yi

Journal:   DOAJ (DOAJ: Directory of Open Access Journals) Year: 2022
© 2026 ScienceGate Book Chapters — All rights reserved.