DISSERTATION

Speech Enhancement using the Wave-U-Net with Spectral Losses

Jose David Bedoya Molina

Year: 2020 University:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

Speech enhancement and source separation are related tasks that aim to extract and/or improve a signal of interest from a recording that may involve sounds from various sources, reverberation, and/or degradation of capture quality. Taking into account that the Wave-U-Net is an end-to-end deep learning architecture that has obtained relevant results for the source separation task operating in the time domain, this thesis studies the performance of this architecture for the speech enhancement task in terms of denoising, dereverberation, decoloration, and bandwidth extension. The experiments were conducted using a combination of a noisy version of the Voice Bank Corpus (VCTK) and the Device and Produced Speech dataset (DAPS). In addition to the original framework, variations inspired by relevant deep learning networks for speech enhancement were explored here, of which losses with spectral components presented the most favorable e˙ects for the improvement of low-quality speech signals. Also, the concatenation of the input audio with a noise vector in the network was shown to generate more coherent high-frequency content in the output signal.

Keywords:
Speech recognition Physics Computer science

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.26
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Infant Health and Development
Health Sciences →  Health Professions →  Pharmacy

Related Documents

JOURNAL ARTICLE

Speech Enhancement using the Wave-U-Net with Spectral Losses

Bedoya Molina, Jose David

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2020
JOURNAL ARTICLE

Performance analysis of speech enhancement using spectral gating with U-Net

Jharna AgrawalManish GuptaHitendra Garg

Journal:   Journal of Electrical Engineering Year: 2023 Vol: 74 (5)Pages: 365-373
JOURNAL ARTICLE

Speech Enhancement Using U-Net with Compressed Sensing

Kang ZhengZhihua HuangChenhua Lu

Journal:   Applied Sciences Year: 2022 Vol: 12 (9)Pages: 4161-4161
© 2026 ScienceGate Book Chapters — All rights reserved.