Sinusoidal masks for single channel speech separation

Pejman Mowlaee; Mads Græsbøll Christensen; Søren Holdt Jensen

doi:10.1109/icassp.2010.5495679

ScienceGate Book Chapters

JOURNAL ARTICLE

Sinusoidal masks for single channel speech separation

Pejman Mowlaee Mads Græsbøll Christensen Søren Holdt Jensen

Year: 2010 Pages: 4262-4265

DOI: 10.1109/icassp.2010.5495679

Get Full-Text PDF Get Analytical Report

Abstract

In this paper we present a new approach for binary and soft masks
used in single-channel speech separation. We present a novel approach
called the sinusoidal mask (binary mask and Wiener filter)
in a sinusoidal space. Theoretical analysis is presented for the proposed
method, and we show that the proposed method is able to minimize
the target speech distortion while suppressing the crosstalk to
a predetermined threshold. It is observed that compared to the STFTbased
masks, the proposed sinusoidal masks improve the separation
performance in terms of objective measures (SSNR and PESQ) and
are mostly preferred by listeners.

Keywords:

PESQ Binary number Computer science Speech recognition Crosstalk Distortion (music) Channel (broadcasting) Filter (signal processing) Source separation Speech enhancement Algorithm Electronic engineering Computer vision Mathematics Engineering Telecommunications

Metrics

Cited By

2.67

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Advanced Adaptive Filtering Techniques

Physical Sciences → Engineering → Computational Mechanics

Blind Source Separation Techniques

Physical Sciences → Computer Science → Signal Processing

Sinusoidal masks for single channel speech separation

Abstract

Metrics

Citation History

Topics

Related Documents

Single channel speech separation based on sinusoidal modeling

Improved single-channel speech separation using sinusoidal modeling

Single-channel music/speech separation using non-linear masks

New Results on Single-Channel Speech Separation Using Sinusoidal Modeling

Single channel speech-music separation using matching pursuit and spectral masks