Monaural Music Source Separation Using Deep Convolutional Neural Network Embedded with Feature Extraction Module

Yongbin Yu; Chenhui Peng; Qian Tang; Xiangxiang Wang

doi:10.1109/cacml55074.2022.00098

ScienceGate Book Chapters

JOURNAL ARTICLE

Monaural Music Source Separation Using Deep Convolutional Neural Network Embedded with Feature Extraction Module

Yongbin Yu Chenhui Peng Qian Tang Xiangxiang Wang

Year: 2022 Journal: 2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML) Vol: abs 1806 3185 Pages: 546-551

DOI: 10.1109/cacml55074.2022.00098

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we propose a novel deep convo-lutional neural network (DCNN) embedded with our feature extraction module (FEM), for monaural music source separation. The UN et ++ is introduced into our FEM for highly flexible feature fusion. At first, an improved encoder-decoder is designed to preliminarily extract multi-scale features from a magnitude spectrogram of the mixture music. Then we use the FEM to further obtain fine features in different scales, and soft masks are finally generated for the separation of each source. The proposed network can capture the main features of multi-scale spectrogram images and make use of the parameters it has learned. We conducted experiments on the MIR-IK dataset and the DSD100 datasets. Our network achieved outstanding performance on the MIR-1K dataset and acquired competitive results on the DSD100 dataset compared with state-of-the-art methods in singing voice separation and source separation tasks.

Keywords:

Computer science Spectrogram Source separation Feature extraction Convolutional neural network Pattern recognition (psychology) Monaural Feature (linguistics) Artificial intelligence Encoder Artificial neural network Speech recognition Deep learning

Metrics

Cited By

0.42

FWCI (Field Weighted Citation Impact)

Refs

0.51

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Blind Source Separation Techniques

Physical Sciences → Computer Science → Signal Processing

Monaural Music Source Separation Using Deep Convolutional Neural Network Embedded with Feature Extraction Module

Abstract

Metrics

Citation History

Topics

Related Documents

Monaural Music Source Separation Using Convolutional Sparse Coding

Monaural Score-Informed Source Separation For Classical Music Using Convolutional Neural Networks.

Monaural Score-Informed Source Separation For Classical Music Using Convolutional Neural Networks.

Monaural Music-Speech Source Separation Based on Convolutional Neural Network for Background Music Identification in TV Shows

Music Source Separation with Deep Convolution Neural Network