CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning
Sjoerd Groot, Qinyu Chen, Jan C. van Gemert, Chang Gao

TL;DR
CleanUMamba introduces a compact, real-time neural network for speech denoising that uses channel pruning and a Mamba-based architecture to achieve high performance with minimal model size and computational cost.
Contribution
The paper proposes a novel U-Net based architecture with Mamba layers and structured channel pruning, enabling efficient, high-quality speech denoising in real-time with significantly reduced model size.
Findings
Achieves PESQ of 2.42 and STOI of 95.1% on Interspeech 2020 challenge
Reduces model size by 8X through channel pruning
Maintains real-time performance with only 442K parameters
Abstract
This paper presents CleanUMamba, a time-domain neural network architecture designed for real-time causal audio denoising directly applied to raw waveforms. CleanUMamba leverages a U-Net encoder-decoder structure, incorporating the Mamba state-space model in the bottleneck layer. By replacing conventional self-attention and LSTM mechanisms with Mamba, our architecture offers superior denoising performance while maintaining a constant memory footprint, enabling streaming operation. To enhance efficiency, we applied structured channel pruning, achieving an 8X reduction in model size without compromising audio quality. Our model demonstrates strong results in the Interspeech 2020 Deep Noise Suppression challenge. Specifically, CleanUMamba achieves a PESQ score of 2.42 and STOI of 95.1% with only 442K parameters and 468M MACs, matching or outperforming larger models in real-time performance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Data Compression Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · Sigmoid Activation · Max Pooling · U-Net · Tanh Activation · Long Short-Term Memory · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
