ZipEnhancer: Dual-Path Down-Up Sampling-based Zipformer for Monaural Speech Enhancement
Haoxu Wang, Biao Tian

TL;DR
ZipEnhancer introduces a dual-path down-up sampling Zipformer model that effectively reduces computational costs in monaural speech enhancement, achieving state-of-the-art results with low parameters and FLOPS.
Contribution
The paper presents ZipEnhancer, a novel dual-path down-up sampling Zipformer architecture with new core blocks and optimization strategies for efficient speech enhancement.
Findings
Achieves state-of-the-art PESQ scores of 3.69 and 3.63 on benchmark datasets.
Uses only 2.04 million parameters and 62.41G FLOPS, demonstrating efficiency.
Outperforms similar complexity models in speech enhancement tasks.
Abstract
In contrast to other sequence tasks modeling hidden layer features with three axes, Dual-Path time and time-frequency domain speech enhancement models are effective and have low parameters but are computationally demanding due to their hidden layer features with four axes. We propose ZipEnhancer, which is Dual-Path Down-Up Sampling-based Zipformer for Monaural Speech Enhancement, incorporating time and frequency domain Down-Up sampling to reduce computational costs. We introduce the ZipformerBlock as the core block and propose the design of the Dual-Path DownSampleStacks that symmetrically scale down and scale up. Also, we introduce the ScaleAdam optimizer and Eden learning rate scheduler to improve the performance further. Our model achieves new state-of-the-art results on the DNS 2020 Challenge and Voicebank+DEMAND datasets, with a perceptual evaluation of speech quality (PESQ) of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Infant Health and Development
