Speech Codec Enhancement With Generative Adversarial Networks

Tao Feng; Ye Li; Peng Zhang; Shu Li; Fuqiang Wang

doi:10.1109/itnec52019.2021.9586878

ScienceGate Book Chapters

JOURNAL ARTICLE

Speech Codec Enhancement With Generative Adversarial Networks

Tao Feng Ye Li Peng Zhang Shu Li Fuqiang Wang

Year: 2021 Journal: 2021 IEEE 5th Information Technology,Networking,Electronic and Automation Control Conference (ITNEC) Pages: 387-391

DOI: 10.1109/itnec52019.2021.9586878

Get Full-Text PDF Get Analytical Report

Abstract

In order to solve the problem of voice quality degradation after the encoding and decoding of various encoders, we propose a processing method for the back-end of various encoders based on a generative countermeasure network. This method uses a generative adversarial network to learn the relational mapping of the speech time domain before and after encoding by the encoder, and conducts training through end-to-end training to restore the quality of the speech encoded by the narrowband encoder. We selected the encoders G.726, G.729 and G723.1 to form a training set with the original speech respectively for training, used a test set composed of speakers that did not appear in the training set for model evaluation to verify the feasibility of the model. It can be seen from the experimental results that the generative adversarial network model we explored improves the encoded speech quality.

Keywords:

Computer science Speech recognition Encoder Codec Encoding (memory) Set (abstract data type) Speech coding Decoding methods Test set PSQM Adversarial system Generative grammar Narrowband Artificial intelligence Speech processing Voice activity detection Algorithm Telecommunications

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.08

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Digital Media Forensic Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Speech Codec Enhancement With Generative Adversarial Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Audio Codec Enhancement with Generative Adversarial Networks

Generative Adversarial Networks for Speech Enhancement

Speech Enhancement via Generative Adversarial LSTM Networks

HiFi-Stream: Streaming Speech Enhancement With Generative Adversarial Networks

Speech Enhancement with Topology-Enhanced Generative Adversarial Networks (GANs)