Improving Design of Input Condition Invariant Speech Enhancement
Wangyou Zhang, Jee-weon Jung, Shinji Watanabe, Yanmin Qian

TL;DR
This paper introduces novel architectures and training strategies to improve input condition invariant speech enhancement, achieving better real-world performance while maintaining efficiency in simulated conditions.
Contribution
The paper proposes redesigned modules and dual-path blocks for input condition invariant speech enhancement, significantly improving real-world robustness and efficiency.
Findings
Enhanced real-condition performance in speech enhancement
Reduced computational costs with novel dual-path blocks
Effective two-stage training strategy improves generalization
Abstract
Building a single universal speech enhancement (SE) system that can handle arbitrary input is a demanded but underexplored research topic. Towards this ultimate goal, one direction is to build a single model that handles diverse audio duration, sampling frequencies, and microphone variations in noisy and reverberant scenarios, which we define here as "input condition invariant SE". Such a model was recently proposed showing promising performance; however, its multi-channel performance degraded severely in real conditions. In this paper we propose novel architectures to improve the input condition invariant SE model so that performance in simulated conditions remains competitive while real condition degradation is much mitigated. For this purpose, we redesign the key components that comprise such a system. First, we identify that the channel-modeling module's generalization to unseen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Speech Recognition and Synthesis
