Single channel speech music separation using nonnegative matrix factorization and spectral masks

Emad M. Grais; Hakan Erdoğan

doi:10.1109/icdsp.2011.6004924

ScienceGate Book Chapters

JOURNAL ARTICLE

Single channel speech music separation using nonnegative matrix factorization and spectral masks

Emad M. Grais Hakan Erdoğan

Year: 2011 Pages: 1-6

DOI: 10.1109/icdsp.2011.6004924

Get Full-Text PDF Get Analytical Report

Abstract

A single channel speech-music separation algorithm based on nonnegative matrix factorization (NMF) with spectral masks is proposed in this work. The proposed algorithm uses training data of speech and music signals with nonnegative matrix factorization followed by masking to separate the mixed signal. In the training stage, NMF uses the training data to train a set of basis vectors for each source. These bases are trained using NMF in the magnitude spectrum domain. After observing the mixed signal, NMF is used to decompose its magnitude spectra into a linear combination of the trained bases for both sources. The decomposition results are used to build a mask, which explains the contribution of each source in the mixed signal. Experimental results show that using masks after NMF improves the separation process even when calculating NMF with fewer iterations, which yields a faster separation process. © 2011 IEEE.

Keywords:

Non-negative matrix factorization Matrix decomposition Source separation Computer science Masking (illustration) Speech recognition Blind signal separation Pattern recognition (psychology) Set (abstract data type) Matrix (chemical analysis) Channel (broadcasting) SIGNAL (programming language) Algorithm Artificial intelligence

Metrics

Cited By

6.21

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Advanced Adaptive Filtering Techniques

Physical Sciences → Engineering → Computational Mechanics

Blind Source Separation Techniques

Physical Sciences → Computer Science → Signal Processing

Single channel speech music separation using nonnegative matrix factorization and spectral masks

Abstract

Metrics

Citation History

Topics

Related Documents

Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks

Single channel speech-music separation using matching pursuit and spectral masks

Single channel music and speech separation using non-negative matrix factorization

Nonnegative matrix factorization with disjointness constraints for single channel speech separation

Single-channel music/speech separation using non-linear masks