DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing

Kuang Yuan; Shuo Han; Swarun Kumar; Bhiksha Raj

arXiv:2409.06137·eess.AS·September 11, 2024

DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing

Kuang Yuan, Shuo Han, Swarun Kumar, Bhiksha Raj

PDF

TL;DR

DeWinder introduces a multi-modal deep learning approach using ultrasound sensing to explicitly detect and reduce wind noise in single-channel outdoor speech recordings, significantly enhancing noise suppression performance.

Contribution

This work is the first to utilize ultrasound sensing as an auxiliary modality for wind noise reduction in speech enhancement.

Findings

01

Significant improvement over state-of-the-art noise reduction models.

02

Effective use of ultrasonic Doppler features for wind noise characterization.

03

Enhanced perceptual speech quality in outdoor environments.

Abstract

The quality of audio recordings in outdoor environments is often degraded by the presence of wind. Mitigating the impact of wind noise on the perceptual quality of single-channel speech remains a significant challenge due to its non-stationary characteristics. Prior work in noise suppression treats wind noise as a general background noise without explicit modeling of its characteristics. In this paper, we leverage ultrasound as an auxiliary modality to explicitly sense the airflow and characterize the wind noise. We propose a multi-modal deep-learning framework to fuse the ultrasonic Doppler features and speech signals for wind noise reduction. Our results show that DeWinder can significantly improve the noise reduction capabilities of state-of-the-art speech enhancement models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.