In this paper we present a new approach for binary and soft masks
used in single-channel speech separation. We present a novel approach
called the sinusoidal mask (binary mask and Wiener filter)
in a sinusoidal space. Theoretical analysis is presented for the proposed
method, and we show that the proposed method is able to minimize
the target speech distortion while suppressing the crosstalk to
a predetermined threshold. It is observed that compared to the STFTbased
masks, the proposed sinusoidal masks improve the separation
performance in terms of objective measures (SSNR and PESQ) and
are mostly preferred by listeners.
Belhedi WiemMohamed Anouar Ben MessaoudAïcha Bouzid
Pejman MowlaeeMads Græsbøll ChristensenSøren Holdt Jensen
Pejman MowlaeeAbolghasem SayadianMansour SheikhanMahdi Fallah
Pejman MowlaeeMads Græsbøll ChristensenSøren Holdt Jensen