Densely Connected Neural Network with Dilated Convolutions for Real-Time Speech Enhancement in The Time Domain

Ashutosh Pandey; DeLiang Wang

doi:10.1109/icassp40776.2020.9054536

ScienceGate Book Chapters

JOURNAL ARTICLE

Densely Connected Neural Network with Dilated Convolutions for Real-Time Speech Enhancement in The Time Domain

Ashutosh Pandey DeLiang Wang

Year: 2020 Pages: 6629-6633

DOI: 10.1109/icassp40776.2020.9054536

Get Full-Text PDF Get Analytical Report

Abstract

In this work, we propose a fully convolutional neural network for real-time speech enhancement in the time domain. The proposed network is an encoder-decoder based architecture with skip connections. The layers in the encoder and the decoder are followed by densely connected blocks comprising of dilated and causal convolutions. The dilated convolutions help in context aggregation at different resolutions. The causal convolutions are used to avoid information flow from future frames, hence making the network suitable for real-time applications. We also propose to use sub-pixel convolutional layers in the decoder for upsampling. Further, the model is trained using a loss function with two components; a time-domain loss and a frequency-domain loss. The proposed loss function outperforms the time-domain loss. Experimental results show that the proposed model significantly outperforms other real-time state-of-the-art models in terms of objective intelligibility and quality scores.

Keywords:

Upsampling Computer science Time domain Encoder Convolutional neural network Context (archaeology) Speech enhancement Convolution (computer science) Algorithm Speech recognition Artificial neural network Artificial intelligence Computer vision Image (mathematics) Noise reduction

Metrics

136

Cited By

11.82

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Advanced Adaptive Filtering Techniques

Physical Sciences → Engineering → Computational Mechanics

Indoor and Outdoor Localization Technologies

Physical Sciences → Engineering → Electrical and Electronic Engineering

Densely Connected Neural Network with Dilated Convolutions for Real-Time Speech Enhancement in The Time Domain

Abstract

Metrics

Citation History

Topics

Related Documents

Densely Connected Dilated Convolutions with Time-Frequency Attention for Speech Enhancement

Densely Connected Network with Time-frequency Dilated Convolution for Speech Enhancement

Real time speech enhancement using densely connected neural networks and Squeezed temporal convolutional modules

DCT based densely connected convolutional GRU for real-time speech enhancement

Dilated convolutions and Time-Frequency Attention for Speech Enhancement