JOURNAL ARTICLE

Speech enhancement in joint time-frequency domain based on real-valued discrete gabor transform

Abstract

In this paper, we propose a new speech enhancement method in joint time-frequency domain. Noisy speech is first transformed into the joint time-frequency domain by fast Real-Valued Discrete Gabor Transform (RDGT) where the Gaussian window is used as the transform kernel due to its superior local energy assembling ability. The MMSE based log-amplitude estimator is derived under speech presence uncertainty hypothesis and also with the assumption that speech and noise are statistically independent Gaussian random variables. Clean speech estimate is then got by inverse transform of RDGT. Experimental results show that the proposed method is very effective in avoiding the musical residual noise and retaining weak speech components.

Keywords:
Gabor transform Speech enhancement Speech recognition Joint (building) Computer science Frequency domain Noise (video) Estimator Kernel (algebra) Time–frequency analysis Discrete Fourier transform (general) Gaussian noise Short-time Fourier transform Pattern recognition (psychology) Mathematics Algorithm Fourier transform Artificial intelligence Noise reduction Statistics Computer vision Fourier analysis Engineering Image (mathematics)

Metrics

2
Cited By
0.33
FWCI (Field Weighted Citation Impact)
15
Refs
0.54
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Advanced Adaptive Filtering Techniques
Physical Sciences →  Engineering →  Computational Mechanics
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.