Catalog-based single-channel speech-music separation for automatic speech recognition

Cemil Demir; Ali Taylan Cemgil; Murat Saraçlar

doi:10.1109/siu.2012.6204782

ScienceGate Book Chapters

JOURNAL ARTICLE

Catalog-based single-channel speech-music separation for automatic speech recognition

Cemil Demir Ali Taylan Cemgil Murat Saraçlar Murat Saraçlar

Year: 2012 Pages: 1-4

DOI: 10.1109/siu.2012.6204782

Get Full-Text PDF Get Analytical Report

Abstract

In this study, we analyze the effect of the catalog-based single-channel speech-music separation method, which we proposed previously, on speech recognition performance. In the proposed method, assuming that we know a catalog of the background music, we developed a generative model for the superposed speech and music spectrograms. We represent the speech spectrogram by a Non-negative Matrix Factorization (NMF) model and the music spectrogram by a conditional Poisson Mixture Model (PMM). In this paper, we propose to recover the speech signals from the mixed signal in time-domain by detecting the active catalog frames using the catalog-based method. We compare the performances of 3 different signal reconstruction techniques; Expectation-Based, Posterior-Based and Time-Domain reconstruction. Moreover, we compare the performance of our system with the performance of the traditional NMF model. Our method outperforms the NMF method in ASR performance and separation performance in most experimental conditions.

Keywords:

Spectrogram Non-negative matrix factorization Computer science Speech recognition Source separation Hidden Markov model Artificial intelligence Channel (broadcasting) Speech enhancement SIGNAL (programming language) Speech processing Pattern recognition (psychology) Matrix decomposition

Metrics

Cited By

0.49

FWCI (Field Weighted Citation Impact)

Refs

0.63

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Blind Source Separation Techniques

Physical Sciences → Computer Science → Signal Processing

Catalog-based single-channel speech-music separation for automatic speech recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Catalog-based single-channel speech-music separation

Single-channel speech-music separation using NMF for automatic speech recognition

Semi-supervised single-channel speech-music separation for automatic speech recognition

Gain estimation approaches in catalog-based single-channel speech-music separation

Analysis of effect of single-channel speech-music separation using NMF to automatic speech recognition