An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement

Zezheng Xu; Ting Jiang; Chao Li; Jiacheng Yu

doi:10.1109/iscslp49672.2021.9362114

ScienceGate Book Chapters

JOURNAL ARTICLE

An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement

Zezheng Xu Ting Jiang Chao Li Jiacheng Yu

Year: 2021 Vol: 11048 Pages: 1-5

DOI: 10.1109/iscslp49672.2021.9362114

Get Full-Text PDF Get Analytical Report

Abstract

Convolutional neural networks (CNN) have made remarkable achievements in speech enhancement. However, the convolution operation is difficult to obtain the global context of the feature map due to its locality. To solve the above problem, we propose an attention-augmented fully convolutional neural network for monaural speech enhancement. More specifically, the method is to integrate a new two-dimensional relative selfattention mechanism into fully convolutional networks. Besides, we utilize Huber Loss as the loss function, which is more robust to noise. Experimental results indicate that compared with the optimally modified log-spectral amplitude (OMLSA) estimator and other CNN-based models, our proposed network has better performance in five indicators, and can well balance noise suppression and speech distortion. What is more, we also embed the proposed attention mechanism into other convolutional networks and get satisfactory results, showing that this mechanism has great generalization ability.

Keywords:

Computer science Convolutional neural network Speech enhancement Monaural Speech recognition Convolution (computer science) Context (archaeology) Estimator Artificial intelligence Noise (video) Generalization Pattern recognition (psychology) Artificial neural network Algorithm Noise reduction Image (mathematics) Mathematics

Metrics

Cited By

0.72

FWCI (Field Weighted Citation Impact)

Refs

0.69

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Advanced Adaptive Filtering Techniques

Physical Sciences → Engineering → Computational Mechanics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement

Abstract

Metrics

Citation History

Topics

Related Documents

Dilated convolutional recurrent neural network for monaural speech enhancement

Redundant Convolutional Network With Attention Mechanism For Monaural Speech Enhancement

Supervised Attention Multi-Scale Temporal Convolutional Network for monaural speech enhancement

Convolutional fusion network for monaural speech enhancement

ICCRN: Inplace Cepstral Convolutional Recurrent Neural Network for Monaural Speech Enhancement