LightCodec: A High Fidelity Neural Audio Codec with Low Computation Complexity

Liang Xu; Jing Wang; Jianqian Zhang; Xiang Xie

doi:10.1109/icassp48485.2024.10447532

ScienceGate Book Chapters

JOURNAL ARTICLE

LightCodec: A High Fidelity Neural Audio Codec with Low Computation Complexity

Liang Xu Jing Wang Jianqian Zhang Xiang Xie

Year: 2024 Pages: 586-590

DOI: 10.1109/icassp48485.2024.10447532

Get Full-Text PDF Get Analytical Report

Abstract

The audio codec is one of the core modules in audio communication for real-time transmission. With the development of neural networks, end-to-end audio codecs have emerged and demonstrated effects beyond conventional codecs. However, current neural network-based codecs have the weakness of high computational complexity, and the performance of these methods decreases rapidly after decreasing the complexity, which is not conducive to deployment under low computational resources. In this paper, a low-complexity audio codec is proposed. To realize the low complexity of the model with high quality, a structure based on frequency band division is designed, which is implemented using a within bandacross band interaction (WBABI) module to learn the features across and within the subband. Further, we propose a new quantization-compensation module, which reduces the quantization error by 90%. The experimental results show that for audio with a sample rate of 24kHz, the model shows excellent performance at 3~6kbps compared to other codecs, and the complexity is only 0.8 Giga Multiply-Add Operations per Second(GMACs).

Keywords:

Codec Computer science Adaptive Multi-Rate audio codec Computational complexity theory Quantization (signal processing) Speech coding Sound quality Speech recognition High fidelity Computer engineering Fidelity Speech processing Algorithm Computer hardware Telecommunications Voice activity detection Engineering

Metrics

Cited By

6.41

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

LightCodec: A High Fidelity Neural Audio Codec with Low Computation Complexity

Abstract

Metrics

Citation History

Topics

Related Documents

HILCodec: High-Fidelity and Lightweight Neural Audio Codec

TFF-Codec: A High Fidelity End-to-End Neural Audio Codec

Audiodec: An Open-Source Streaming High-Fidelity Neural Audio Codec

A high-fidelity speech and audio codec with low delay and low complexity

High-Fidelity Diffusion-Based Audio Codec