Binaural Source Separation with Convolutional Neural Networks

Erruz, Gerard

doi:10.5281/zenodo.1095835

ScienceGate Book Chapters

JOURNAL ARTICLE

Binaural Source Separation with Convolutional Neural Networks

Erruz, Gerard

Year: 2017 Journal: Zenodo (CERN European Organization for Nuclear Research) Publisher: European Organization for Nuclear Research

DOI: 10.5281/zenodo.1095835

Get Full-Text PDF Get Analytical Report

Abstract

This work is a study on source separation techniques for binaural music mixtures. The chosen framework uses a Convolutional Neural Network (CNN) to estimate time-frequency soft masks. This masks are used to extract the different sources from the original two-channel mixture signal. Its baseline single-channel architecture performed state-of-the-art results on monaural music mixtures under low-latency conditions. It has been extended to perform separation in two-channel signals, being the first two-channel CNN joint estimation architecture. This means that filters are learned for each source by taking in account both channels information. Furthermore, a specific binaural condition is included during training stage. It uses Interaural Level Difference (ILD) information to improve spatial images of extracted sources. Concurrently, we present a novel tool to create binaural scenes for testing purposes. Multiple binaural scenes are rendered from a music dataset of four instruments (voice, drums, bass and others). The CNN framework have been tested for these binaural scenes and compared with monaural and stereo results. The system showed a great amount of adaptability and good separation results in all the scenarios. These results are used to evaluate spatial information impact on separation performance.

Keywords:

Binaural recording Monaural Source separation Convolutional neural network Pattern recognition (psychology) Separation (statistics) Artificial neural network

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.35

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Blind Source Separation Techniques

Physical Sciences → Computer Science → Signal Processing

Hearing Loss and Rehabilitation

Life Sciences → Neuroscience → Cognitive Neuroscience

Binaural Source Separation with Convolutional Neural Networks

Abstract

Metrics

Topics

Related Documents

Binaural Source Separation with Convolutional Neural Networks

Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks

Monoaural Audio Source Separation Using Deep Convolutional Neural Networks

Low latency sound source separation using convolutional recurrent neural networks

Binaural Sound Source Localization Based on Convolutional Neural Network