Speech Enhancement using the Wave-U-Net with Spectral Losses

Jose David Bedoya Molina

doi:10.5281/zenodo.4091394

ScienceGate Book Chapters

DISSERTATION

Speech Enhancement using the Wave-U-Net with Spectral Losses

Jose David Bedoya Molina

Year: 2020 University: Zenodo (CERN European Organization for Nuclear Research) Publisher: European Organization for Nuclear Research

DOI: 10.5281/zenodo.4091394

Get Full-Text PDF Get Analytical Report

Abstract

Speech enhancement and source separation are related tasks that aim to extract and/or improve a signal of interest from a recording that may involve sounds from various sources, reverberation, and/or degradation of capture quality. Taking into account that the Wave-U-Net is an end-to-end deep learning architecture that has obtained relevant results for the source separation task operating in the time domain, this thesis studies the performance of this architecture for the speech enhancement task in terms of denoising, dereverberation, decoloration, and bandwidth extension. The experiments were conducted using a combination of a noisy version of the Voice Bank Corpus (VCTK) and the Device and Produced Speech dataset (DAPS). In addition to the original framework, variations inspired by relevant deep learning networks for speech enhancement were explored here, of which losses with spectral components presented the most favorable e˙ects for the improvement of low-quality speech signals. Also, the concatenation of the input audio with a noise vector in the network was shown to generate more coherent high-frequency content in the output signal.

Keywords:

Speech recognition Physics Computer science

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.26

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Infant Health and Development

Health Sciences → Health Professions → Pharmacy

Speech Enhancement using the Wave-U-Net with Spectral Losses

Abstract

Metrics

Topics

Related Documents

Speech Enhancement using the Wave-U-Net with Spectral Losses

Performance analysis of speech enhancement using spectral gating with U-Net

Attention Wave-U-Net for Speech Enhancement

Speech Enhancement Using Dilated Wave-U-Net: an Experimental Analysis

Speech Enhancement Using U-Net with Compressed Sensing