JOURNAL ARTICLE

Speech Codec Enhancement With Generative Adversarial Networks

Tao FengYe LiPeng ZhangShu LiFuqiang Wang

Year: 2021 Journal:   2021 IEEE 5th Information Technology,Networking,Electronic and Automation Control Conference (ITNEC) Pages: 387-391

Abstract

In order to solve the problem of voice quality degradation after the encoding and decoding of various encoders, we propose a processing method for the back-end of various encoders based on a generative countermeasure network. This method uses a generative adversarial network to learn the relational mapping of the speech time domain before and after encoding by the encoder, and conducts training through end-to-end training to restore the quality of the speech encoded by the narrowband encoder. We selected the encoders G.726, G.729 and G723.1 to form a training set with the original speech respectively for training, used a test set composed of speakers that did not appear in the training set for model evaluation to verify the feasibility of the model. It can be seen from the experimental results that the generative adversarial network model we explored improves the encoded speech quality.

Keywords:
Computer science Speech recognition Encoder Codec Encoding (memory) Set (abstract data type) Speech coding Decoding methods Test set PSQM Adversarial system Generative grammar Narrowband Artificial intelligence Speech processing Voice activity detection Algorithm Telecommunications

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
27
Refs
0.08
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Digital Media Forensic Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.