On the difference-to-sum power ratio of speech and wind noise based on the Corcos model
Daniele Mirabilii, Emanu\"el A.P. Habets

TL;DR
This paper introduces a generalized difference-to-sum power ratio based on the Corcos model for wind noise suppression and detection in speech signals, validated with real data and showing improved performance over existing methods.
Contribution
It presents a new formulation of the difference-to-sum power ratio incorporating the Corcos model for all air stream directions and lateral coherence decay, enhancing wind noise detection.
Findings
Validated with real microphone data
Improved wind noise detection performance
Generalized model for various airflow directions
Abstract
The difference-to-sum power ratio was proposed and used to suppress wind noise under specific acoustic conditions. In this contribution, a general formulation of the difference-to-sum power ratio associated with a mixture of speech and wind noise is proposed and analyzed. In particular, it is assumed that the complex coherence of convective turbulence can be modelled by the Corcos model. In contrast to the work in which the power ratio was first presented, the employed Corcos model holds for every possible air stream direction and takes into account the lateral coherence decay rate. The obtained expression is subsequently validated with real data for a dual microphone set-up. Finally, the difference-to- sum power ratio is exploited as a spatial feature to indicate the frame-wise presence of wind noise, obtaining improved detection performance when compared to an existing multi-channel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
