JOURNAL ARTICLE

Neural Audio Coding with Deep Complex Networks

Jiawei RuLizhong WangMaoshen JiaLiang WenHandong WangYuhao ZhaoJing Wang

Year: 2024 Journal:   Journal of Physics Conference Series Vol: 2759 (1)Pages: 012005-012005   Publisher: IOP Publishing

Abstract

Abstract This paper proposes a transform domain audio coding method based on deep complex networks. In the proposed codec, the time-frequency spectrum of the audio signal is fed to the encoder which consists of complex convolutional blocks and a frequency-temporal modeling module to obtain the extracted features which are then quantized with a target bitrate by the vector quantizer. The structure of the decoder which reconstruct the time-frequency spectrum of the audio from quantized features is symmetrical to the encoder. In this paper, a structure combining the complex multi-head self-attention module and the complex long short-term memory is proposed to capture both frequency and temporal dependencies. Subjective and objective evaluation tests show the advantage of the proposed method.

Keywords:
Computer science Coding (social sciences) Artificial neural network Deep neural networks Speech recognition Artificial intelligence Mathematics

Metrics

1
Cited By
0.71
FWCI (Field Weighted Citation Impact)
9
Refs
0.54
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

End-to-end Stereo Audio Coding Using Deep Neural Networks

Wootaek LımInseon JangSeungkwon BeackJongmo SungTae‐Jin Lee

Journal:   2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Year: 2022 Pages: 860-864
JOURNAL ARTICLE

Audio classification with complex neural networks

I. М. ChernenkiyN N TrufanovD. P. EgorovO. V. Kravchenko

Journal:   AIP conference proceedings Year: 2023 Vol: 2819 Pages: 030003-030003
JOURNAL ARTICLE

Audio Representation Learning with Deep Neural Networks

Mohammad Rasool Izadi

Journal:   OPAL (Open@LaTrobe) (La Trobe University) Year: 2023
JOURNAL ARTICLE

TDSNN: From Deep Neural Networks to Deep Spike Neural Networks with Temporal-Coding

Lei ZhangShengyuan ZhouTian ZhiZidong DuYunji Chen

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2019 Vol: 33 (01)Pages: 1319-1326
JOURNAL ARTICLE

Multichannel Audio Source Separation With Deep Neural Networks

Aditya Arie NugrahaAntoine LiutkusEmmanuel Vincent

Journal:   IEEE/ACM Transactions on Audio Speech and Language Processing Year: 2016 Vol: 24 (9)Pages: 1652-1664
© 2026 ScienceGate Book Chapters — All rights reserved.