JOURNAL ARTICLE

Noise Classification Speech Enhancement Generative Adversarial Network

Tao FengYe LiPeng ZhangShu LiFuqiang Wang

Year: 2022 Journal:   2022 IEEE 6th Information Technology and Mechatronics Engineering Conference (ITOEC) Pages: 11-16

Abstract

The purpose of speech enhancement is to extract the speech signal from various noise backgrounds, improving the quality of the speech signal. After the emergence of the Speech Enhancement Generative Adversarial (SEGAN), it has achieved good results in the field of speech enhancement. However, SEGAN does not have an excellent speech enhancement effect in the case of low signal-to-noise ratio, it has weak generalization ability in the face of unknown noise. In this paper, we propose a method of generative adversarial network speech enhancement using noise background classification. In this method, the inputs are noisy speeches, which have a variety of background noises. Mel Frequency Cepstral Coefficient (MFCC) features of noisy speeches are extracted, convolutional neural network is used to classify each noisy background, and the classified noisy speeches are labeled with the type of background noise. The labeled noisy speeches are sent to the speech enhancement model. There are several SEGANs in the speech enhancement model. Each SEGAN enhances noisy speeches with a particular of background noise. Under extremely low signal-to-noise ratio conditions and in the face of unknown noise, we evaluate this method in extensive experiments, using objective evaluation indicators to evaluate the effectiveness of the model. Compared with the SEGAN model under the condition of extremely low signal-to-noise ratio, the model in this paper can eliminate noise better, and each objective index has been improved. In the face of unknown background noise, objective evaluation index of NCSEGAN is better than SEGAN, which confirms the effectiveness of the method.

Keywords:
Speech enhancement Speech recognition Computer science Noise (video) Noise measurement Background noise Signal-to-noise ratio (imaging) Artificial intelligence Pattern recognition (psychology) Cepstrum Noise reduction Telecommunications

Metrics

3
Cited By
0.42
FWCI (Field Weighted Citation Impact)
0
Refs
0.47
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

VSEGAN: Visual Speech Enhancement Generative Adversarial Network

Xinmeng XuYang WangDongxiang XuYiyuan PengCong ZhangJie JiaBinbin Chen

Journal:   ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Year: 2022 Pages: 7308-7311
BOOK-CHAPTER

Speech Enhancement Using Generative Adversarial Network (GAN)

Mahmudul HuqRytis Maskeliūnas

Lecture notes in networks and systems Year: 2022 Pages: 273-282
JOURNAL ARTICLE

Speech Enhancement via Residual Dense Generative Adversarial Network

Lin ZhouQiuyue ZhongTianyi WangSiyuan LuHongmei Hu

Journal:   Computer Systems Science and Engineering Year: 2021 Vol: 38 (3)Pages: 279-289
JOURNAL ARTICLE

Improved Wasserstein conditional generative adversarial network speech enhancement

Shan QinTing Jiang

Journal:   EURASIP Journal on Wireless Communications and Networking Year: 2018 Vol: 2018 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.