JOURNAL ARTICLE

Joint Training of Deep Neural Networks for Multi-Channel Dereverberation and Speech Source Separation

Abstract

In this paper, we propose a joint training of two deep neural networks (DNNs) for dereverberation and speech source separation. The proposed method connects the first DNN, the dereverberation part, the second DNN, and the speech source separation part in a cascade manner. The proposed method does not train each DNN separately. Instead, an integrated loss function which evaluates an output signal after dereverberation and speech source separation is adopted. The proposed method estimates the output signal as a probabilistic variable. Recently, in the speech source separation context, we proposed a loss function which evaluates the estimated posterior probability density function (PDF) of the output signal. In this paper, we extend this loss function into a loss function which evaluates not only speech source separation performance but also speech derevereberation performance. Since the output signal of the dereverberation part is converted into the input feature of the second DNN, gradient of the loss function is back-propagated into the first DNN through the input feature of the second DNN. Experimental results show that the proposed joint training of two DNNs is effective. It is also shown that the posterior PDF based loss function is effective in the joint training context.

Keywords:
Computer science Source separation Speech recognition Joint (building) Context (archaeology) Artificial neural network SIGNAL (programming language) Reverberation Channel (broadcasting) Feature (linguistics) Function (biology) Probabilistic logic Artificial intelligence Acoustics Telecommunications Engineering

Metrics

8
Cited By
0.89
FWCI (Field Weighted Citation Impact)
33
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Source Separation and Speech Dereverberation

Signals and communication technology Year: 2006 Pages: 319-351
BOOK-CHAPTER

Source Separation and Speech Dereverberation

Signals and communication technology Year: 2007 Pages: 319-351
JOURNAL ARTICLE

Audio-Visual Multi-Channel Speech Separation, Dereverberation and Recognition

Guinan LiJianwei YuJiajun DengXunying LiuHelen Meng

Journal:   ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Year: 2022 Pages: 6042-6046
© 2026 ScienceGate Book Chapters — All rights reserved.