Unsupervised Vocal Dereverberation with Diffusion-Based Generative Models

Koichi Saito; Naoki Murata; Toshimitsu Uesaka; Chieh-Hsin Lai; Yuhta Takida; Takao Fukui; Yuki Mitsufuji

doi:10.1109/icassp49357.2023.10095761

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Vocal Dereverberation with Diffusion-Based Generative Models

Koichi Saito Naoki Murata Toshimitsu Uesaka Chieh-Hsin Lai Yuhta Takida Takao Fukui Yuki Mitsufuji

Year: 2023 Pages: 1-5

DOI: 10.1109/icassp49357.2023.10095761

Get Full-Text PDF Get Analytical Report

Abstract

Removing reverb from reverberant music is a necessary technique to clean up audio for downstream music manipulations. Reverberation of music contains two categories, natural reverb, and artificial reverb. Artificial reverb has a wider diversity than natural reverb due to its various parameter setups and reverberation types. However, recent supervised dereverberation methods may fail because they rely on sufficiently diverse and numerous pairs of reverberant observations and retrieved data for training in order to be generalizable to unseen observations during inference. To resolve these problems, we propose an unsupervised method that can remove a general kind of artificial reverb for music without requiring pairs of data for training. The proposed method is based on diffusion models, where it initializes the unknown reverberation operator with a conventional signal processing technique and simultaneously refines the estimate with the help of diffusion models. We show through objective and perceptual evaluations that our method outperforms the current leading vocal dereverberation benchmarks.

Keywords:

Reverberation Computer science Speech recognition Generative model Artificial intelligence Inference Generative grammar Acoustics

Metrics

Cited By

2.68

FWCI (Field Weighted Citation Impact)

Refs

0.88

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Acoustic Wave Phenomena Research

Physical Sciences → Engineering → Biomedical Engineering

Unsupervised Vocal Dereverberation with Diffusion-Based Generative Models

Abstract

Metrics

Citation History

Topics

Related Documents

Speech Enhancement and Dereverberation With Diffusion-Based Generative Models

Unsupervised Speech Enhancement with Diffusion-Based Generative Models

BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models

Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation With Diffusion Models

Unsupervised Multi-channel Speech Dereverberation via Diffusion