Design of Medium to Low Bitrate Neural Audio Codec

Samarpreet Singh; Saurabh Singh Raghuvanshi; Vinal Patel

doi:10.1109/i2ct57861.2023.10126323

ScienceGate Book Chapters

JOURNAL ARTICLE

Design of Medium to Low Bitrate Neural Audio Codec

Samarpreet Singh Saurabh Singh Raghuvanshi Vinal Patel

Year: 2023 Vol: 182 Pages: 1-5

DOI: 10.1109/i2ct57861.2023.10126323

Get Full-Text PDF Get Analytical Report

Abstract

Neural audio codecs are the most recent development in the field of audio compression. Traditional audio codecs rely on fixed signal processing pipelines and require domain-specific expertise to produce high-quality audio at low to high bit rates. However, the performance of conventional audio codecs usually degrades at low bit rates. Neural audio codecs perform enhancement and compression with no added latency. This paper further enhances the quality of neural audio codecs by integrating a psychoacoustic model with the existing structure that contains a convolutional encoder, decoder, and a residual vector quantizer. It used a combination of reconstruction and adversarial loss to train the model to generate high-quality audio content. Audio quality measures like PEAQ and MUSHRA are conducted to illustrate that the proposed model performs better than the existing model of neural audio codec.

Keywords:

Codec Computer science Speech coding Speech recognition Sound quality Encoder Audio signal Adaptive Multi-Rate audio codec Psychoacoustics Data compression Artificial intelligence Speech processing Computer hardware Voice activity detection

Metrics

Cited By

0.81

FWCI (Field Weighted Citation Impact)

Refs

0.65

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Image and Signal Denoising Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Design of Medium to Low Bitrate Neural Audio Codec

Abstract

Metrics

Citation History

Topics

Related Documents

An Ultra-Low Bitrate Neural Audio Codec Under NB-IoT

Neural Audio Codec

Efficient stereo bitrate allocation for fully scalable audio codec

LSPnet: an ultra-low bitrate hybrid neural codec

AI-Based Bitrate Selection Method for CMR-Supported Audio Codec