Inference and Denoise: Causal Inference-based Neural Speech Enhancement
Tsun-An Hsieh, Chao-Han Huck Yang, Pin-Yu Chen, Sabato Marco, Siniscalchi, Yu Tsao

TL;DR
This paper introduces a causal inference framework for speech enhancement that models noise as an intervention, leading to improved performance and efficiency over traditional methods.
Contribution
It proposes a novel causal inference-based speech enhancement method that uses noise as an intervention and a noise detector to improve denoising accuracy.
Findings
CISE outperforms non-causal mask-based SE methods.
CISE achieves better performance and efficiency than complex SE models.
A SE-specific average treatment effect is derived for causal quantification.
Abstract
This study addresses the speech enhancement (SE) task within the causal inference paradigm by modeling the noise presence as an intervention. Based on the potential outcome framework, the proposed causal inference-based speech enhancement (CISE) separates clean and noisy frames in an intervened noisy speech using a noise detector and assigns both sets of frames to two mask-based enhancement modules (EMs) to perform noise-conditional SE. Specifically, we use the presence of noise as guidance for EM selection during training, and the noise detector selects the enhancement module according to the prediction of the presence of noise for each frame. Moreover, we derived a SE-specific average treatment effect to quantify the causal effect adequately. Experimental evidence demonstrates that CISE outperforms a non-causal mask-based SE approach in the studied settings and has better performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Speech Recognition and Synthesis
