JOURNAL ARTICLE

An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement

Abstract

Convolutional neural networks (CNN) have made remarkable achievements in speech enhancement. However, the convolution operation is difficult to obtain the global context of the feature map due to its locality. To solve the above problem, we propose an attention-augmented fully convolutional neural network for monaural speech enhancement. More specifically, the method is to integrate a new two-dimensional relative selfattention mechanism into fully convolutional networks. Besides, we utilize Huber Loss as the loss function, which is more robust to noise. Experimental results indicate that compared with the optimally modified log-spectral amplitude (OMLSA) estimator and other CNN-based models, our proposed network has better performance in five indicators, and can well balance noise suppression and speech distortion. What is more, we also embed the proposed attention mechanism into other convolutional networks and get satisfactory results, showing that this mechanism has great generalization ability.

Keywords:
Computer science Convolutional neural network Speech enhancement Monaural Speech recognition Convolution (computer science) Context (archaeology) Estimator Artificial intelligence Noise (video) Generalization Pattern recognition (psychology) Artificial neural network Algorithm Noise reduction Image (mathematics) Mathematics

Metrics

5
Cited By
0.72
FWCI (Field Weighted Citation Impact)
28
Refs
0.69
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Advanced Adaptive Filtering Techniques
Physical Sciences →  Engineering →  Computational Mechanics
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.