JOURNAL ARTICLE

Deep Noise Suppression for Real Time Speech Enhancement in a Single Channel Wide Band Scenario

Gómez, Esteban

Year: 2021 Journal:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

Speech enhancement can be regarded as a dual task that addresses two important issues of degraded speech: Speech quality and speech intelligibility. Improved speech quality can reduce listener’s fatigue, whereas improved speech intelligibility can re-duce the listener’s e˙ort to understand and extract meaning from speech. This work is focused on speech quality in a real time context. Algorithms that improve speech quality are sometimes referred to as noise suppression algorithms, since they enhance quality by suppressing the background noise of the degraded speech. Improving state of the art noise suppression algorithms could lead to significant benefits to several applications such as video conferencing systems, phone calls or speech recognition systems. Real time capable algorithms are especially important for devices with a limited processing power and physical constraints that cannot make use of large architectures, such as hearing aids or wearables. This work uses a deep learning based approach to expand on two previously proposed architectures in the context of the Deep Noise Suppression Challenge carried out by Microsoft. This challenge has provided datasets and resources to teams of researchers with the common goal of fostering the research on the aforementioned topic. The outcome of this thesis can be divided into three main contributions: First, an extended comparison between six variants of the two selected models, considering performance, computational com-plexity and real time eÿciency analyses. Secondly, making available an open source implementation of one of the proposed architectures as well as a framework transla-tion of an existing implementation. Finally, proposed variants that outperform the previously defined models in terms of denoising performance, complexity and real time eÿciency.

Keywords:
Speech enhancement Intelligibility (philosophy) Voice activity detection Speech processing Noise (video) Background noise Phone Task (project management)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.28
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Advanced Adaptive Filtering Techniques
Physical Sciences →  Engineering →  Computational Mechanics
Hearing Loss and Rehabilitation
Life Sciences →  Neuroscience →  Cognitive Neuroscience

Related Documents

DISSERTATION

Deep Noise Suppression for Real Time Speech Enhancement in a Single Channel Wide Band Scenario

Esteban Moreno Gómez

University:   Zenodo (CERN European Organization for Nuclear Research) Year: 2021
JOURNAL ARTICLE

Real-time speech enhancement algorithm for transient noise suppression

Ruiyu LiangYue XieJiaming ChengGuichen TangShinuo Sun

Journal:   Multimedia Tools and Applications Year: 2020 Vol: 80 (3)Pages: 3681-3702
JOURNAL ARTICLE

A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement.

Sir.Ai

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2021
© 2026 ScienceGate Book Chapters — All rights reserved.