JOURNAL ARTICLE

Neural Audio Decorrelation Using Generative Adversarial Networks

Abstract

The spatial perception of a sound image is significantly influenced by the degree of correlation between the sounds received by the ears. Audio signal decorrelation is, therefore, a commonly used tool in various spatial audio processing applications. In this paper, we propose a novel approach to audio decorrelation using generative adversarial networks. As generator, we employ a convolutional neural network architecture that has been recently proposed for audio decorrelation. In contrast to the previous work, the loss function is defined directly w.r.t. the input audio signal, i.e., a decorrelated reference signal is not required. The training objective includes a number of individual loss terms to control both the output-input correlation and the output signal quality. This enables specifically tailoring the training procedure to the desired output signal properties and possibly outperforming conventional decorrelation techniques in terms of performance and flexibility. The proposed approach is compared to a state-of-the-art conventional decorrelation method by means of objective evaluations as well as through listening tests, considering a variety of signal types.

Keywords:
Decorrelation Computer science Speech recognition Audio signal processing SIGNAL (programming language) Audio signal Artificial intelligence Signal processing Pattern recognition (psychology) Computer vision Speech coding Radar Telecommunications

Metrics

2
Cited By
0.54
FWCI (Field Weighted Citation Impact)
40
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.