Task-aware Warping Factors in Mask-based Speech Enhancement
Qiongqiong Wang, Kong Aik Lee, Takafumi Koshinaka, Koji Okabe, Hitoshi, Yamamoto

TL;DR
This paper introduces task-aware warping factors in mask-based speech enhancement to optimize performance across multiple downstream tasks like speech quality, speaker verification, and speech recognition, without task-specific training.
Contribution
It proposes a novel dual-warping factors approach that allows a single speech enhancement system to adapt to multiple tasks by controlling enhancement parameters during training and testing.
Findings
Speech quality improved by 84.7% PESQ at 0dB
Speaker verification EER reduced by 22.4%
Speech recognition WER reduced by 52.2%
Abstract
This paper proposes the use of two task-aware warping factors in mask-based speech enhancement (SE). One controls the balance between speech-maintenance and noise-removal in training phases, while the other controls SE power applied to specific downstream tasks in testing phases. Our intention is to alleviate the problem that SE systems trained to improve speech quality often fail to improve other downstream tasks, such as automatic speaker verification (ASV) and automatic speech recognition (ASR), because they do not share the same objects. It is easy to apply the proposed dual-warping factors approach to any mask-based SE method, and it allows a single SE system to handle multiple tasks without task-dependent training. The effectiveness of our proposed approach has been confirmed on the SITW dataset for ASV evaluation and the LibriSpeech dataset for ASR and speech quality evaluations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Indoor and Outdoor Localization Technologies
