Unsupervised vocal dereverberation with diffusion-based generative   models

Koichi Saito; Naoki Murata; Toshimitsu Uesaka; Chieh-Hsin Lai; Yuhta; Takida; Takao Fukui; Yuki Mitsufuji

arXiv:2211.04124·eess.AS·November 9, 2022·1 cites

Unsupervised vocal dereverberation with diffusion-based generative models

Koichi Saito, Naoki Murata, Toshimitsu Uesaka, Chieh-Hsin Lai, Yuhta, Takida, Takao Fukui, Yuki Mitsufuji

PDF

Open Access

TL;DR

This paper introduces an unsupervised diffusion-based approach to remove artificial reverb from music recordings, overcoming the limitations of supervised methods that require paired training data.

Contribution

It presents a novel unsupervised dereverberation technique that combines traditional signal processing with diffusion models, capable of handling diverse artificial reverb without paired training data.

Findings

01

Outperforms current vocal dereverberation benchmarks in objective evaluations.

02

Effectively removes a wide range of artificial reverb types.

03

Operates without requiring paired reverberant and clean data.

Abstract

Removing reverb from reverberant music is a necessary technique to clean up audio for downstream music manipulations. Reverberation of music contains two categories, natural reverb, and artificial reverb. Artificial reverb has a wider diversity than natural reverb due to its various parameter setups and reverberation types. However, recent supervised dereverberation methods may fail because they rely on sufficiently diverse and numerous pairs of reverberant observations and retrieved data for training in order to be generalizable to unseen observations during inference. To resolve these problems, we propose an unsupervised method that can remove a general kind of artificial reverb for music without requiring pairs of data for training. The proposed method is based on diffusion models, where it initializes the unknown reverberation operator with a conventional signal processing technique…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Acoustic Wave Phenomena Research

Methodsfail · Diffusion