JOURNAL ARTICLE

Monaural Music Source Separation Using Deep Convolutional Neural Network Embedded with Feature Extraction Module

Yongbin YuChenhui PengQian TangXiangxiang Wang

Year: 2022 Journal:   2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML) Vol: abs 1806 3185 Pages: 546-551

Abstract

In this paper, we propose a novel deep convo-lutional neural network (DCNN) embedded with our feature extraction module (FEM), for monaural music source separation. The UN et ++ is introduced into our FEM for highly flexible feature fusion. At first, an improved encoder-decoder is designed to preliminarily extract multi-scale features from a magnitude spectrogram of the mixture music. Then we use the FEM to further obtain fine features in different scales, and soft masks are finally generated for the separation of each source. The proposed network can capture the main features of multi-scale spectrogram images and make use of the parameters it has learned. We conducted experiments on the MIR-IK dataset and the DSD100 datasets. Our network achieved outstanding performance on the MIR-1K dataset and acquired competitive results on the DSD100 dataset compared with state-of-the-art methods in singing voice separation and source separation tasks.

Keywords:
Computer science Spectrogram Source separation Feature extraction Convolutional neural network Pattern recognition (psychology) Monaural Feature (linguistics) Artificial intelligence Encoder Artificial neural network Speech recognition Deep learning

Metrics

3
Cited By
0.42
FWCI (Field Weighted Citation Impact)
39
Refs
0.51
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Monaural Music Source Separation Using Convolutional Sparse Coding

Ping-Keng JaoLi SuYi‐Hsuan YangBrendt Wohlberg

Journal:   IEEE/ACM Transactions on Audio Speech and Language Processing Year: 2016 Vol: 24 (11)Pages: 2158-2170
JOURNAL ARTICLE

Monaural Score-Informed Source Separation For Classical Music Using Convolutional Neural Networks.

Marius MironJordi JanerEmília Gómez

Journal:   International Symposium/Conference on Music Information Retrieval Year: 2017 Pages: 55-62
JOURNAL ARTICLE

Monaural Score-Informed Source Separation For Classical Music Using Convolutional Neural Networks.

Miron, MariusJaner, JordiGómez, Emilia

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2017
JOURNAL ARTICLE

Monaural Music-Speech Source Separation Based on Convolutional Neural Network for Background Music Identification in TV Shows

Hyemi KimwWoon-Haeng HeoJung-Hyun KimJihyun Park

Journal:   The Journal of Korean Institute of Communications and Information Sciences Year: 2020 Vol: 45 (5)Pages: 855-866
BOOK-CHAPTER

Music Source Separation with Deep Convolution Neural Network

Patil MangalRenuka Deolalikar

Lecture notes in networks and systems Year: 2022 Pages: 199-206
© 2026 ScienceGate Book Chapters — All rights reserved.