Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model
Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann

TL;DR
This paper introduces a diffusion-based stochastic model for single-channel wind noise reduction that outperforms existing neural network methods and generalizes well to unseen real-world data.
Contribution
The paper proposes a novel diffusion-based stochastic regeneration model that effectively reduces wind noise and accounts for non-linear deformations caused by wind flow.
Findings
Outperforms other neural-network-based wind noise reduction methods
Generalizes well to unseen real-recorded wind noise datasets
Demonstrates effectiveness on both simulated and real wind noise data
Abstract
In this paper we present a method for single-channel wind noise reduction using our previously proposed diffusion-based stochastic regeneration model combining predictive and generative modelling. We introduce a non-additive speech in noise model to account for the non-linear deformation of the membrane caused by the wind flow and possible clipping. We show that our stochastic regeneration model outperforms other neural-network-based wind noise reduction methods as well as purely predictive and generative models, on a dataset using simulated and real-recorded wind noise. We further show that the proposed method generalizes well by testing on an unseen dataset with real-recorded wind noise. Audio samples, data generation scripts and code for the proposed methods can be found online (https://uhh.de/inf-sp-storm-wind).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music Technology and Sound Studies · Music and Audio Processing
