CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning

Sjoerd Groot; Qinyu Chen; Jan C. van Gemert; Chang Gao

arXiv:2410.11062·cs.SD·July 3, 2025

CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning

Sjoerd Groot, Qinyu Chen, Jan C. van Gemert, Chang Gao

PDF

Open Access 1 Repo

TL;DR

CleanUMamba introduces a compact, real-time neural network for speech denoising that uses channel pruning and a Mamba-based architecture to achieve high performance with minimal model size and computational cost.

Contribution

The paper proposes a novel U-Net based architecture with Mamba layers and structured channel pruning, enabling efficient, high-quality speech denoising in real-time with significantly reduced model size.

Findings

01

Achieves PESQ of 2.42 and STOI of 95.1% on Interspeech 2020 challenge

02

Reduces model size by 8X through channel pruning

03

Maintains real-time performance with only 442K parameters

Abstract

This paper presents CleanUMamba, a time-domain neural network architecture designed for real-time causal audio denoising directly applied to raw waveforms. CleanUMamba leverages a U-Net encoder-decoder structure, incorporating the Mamba state-space model in the bottleneck layer. By replacing conventional self-attention and LSTM mechanisms with Mamba, our architecture offers superior denoising performance while maintaining a constant memory footprint, enabling streaming operation. To enhance efficiency, we applied structured channel pruning, achieving an 8X reduction in model size without compromising audio quality. Our model demonstrates strong results in the Interspeech 2020 Deep Noise Suppression challenge. Specifically, CleanUMamba achieves a PESQ score of 2.42 and STOI of 95.1% with only 442K parameters and 468M MACs, matching or outperforming larger models in real-time performance.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lab-emi/cleanumamba
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Data Compression Techniques

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · Sigmoid Activation · Max Pooling · U-Net · Tanh Activation · Long Short-Term Memory · Mamba: Linear-Time Sequence Modeling with Selective State Spaces