JOURNAL ARTICLE

Design of Medium to Low Bitrate Neural Audio Codec

Abstract

Neural audio codecs are the most recent development in the field of audio compression. Traditional audio codecs rely on fixed signal processing pipelines and require domain-specific expertise to produce high-quality audio at low to high bit rates. However, the performance of conventional audio codecs usually degrades at low bit rates. Neural audio codecs perform enhancement and compression with no added latency. This paper further enhances the quality of neural audio codecs by integrating a psychoacoustic model with the existing structure that contains a convolutional encoder, decoder, and a residual vector quantizer. It used a combination of reconstruction and adversarial loss to train the model to generate high-quality audio content. Audio quality measures like PEAQ and MUSHRA are conducted to illustrate that the proposed model performs better than the existing model of neural audio codec.

Keywords:
Codec Computer science Speech coding Speech recognition Sound quality Encoder Audio signal Adaptive Multi-Rate audio codec Psychoacoustics Data compression Artificial intelligence Speech processing Computer hardware Voice activity detection

Metrics

3
Cited By
0.81
FWCI (Field Weighted Citation Impact)
21
Refs
0.65
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.