U-Mamba-Net: A highly efficient Mamba-based U-net style network for noisy and reverberant speech separation
Shaoxiang Dang, Tetsuya Matsumoto, Yoshinori Takeuchi, Hiroaki Kudo

TL;DR
U-Mamba-Net is a lightweight, efficient U-Net style neural network utilizing Mamba for feature filtering, designed for effective speech separation in noisy and reverberant environments with low computational demands.
Contribution
The paper introduces U-Mamba-Net, a novel lightweight model combining Mamba and U-Net architectures for efficient speech separation in complex acoustic conditions.
Findings
Achieves improved speech separation performance on Libri2mix
Maintains low computational cost compared to existing models
Demonstrates effectiveness in noisy and reverberant environments
Abstract
The topic of speech separation involves separating mixed speech with multiple overlapping speakers into several streams, with each stream containing speech from only one speaker. Many highly effective models have emerged and proliferated rapidly over time. However, the size and computational load of these models have also increased accordingly. This is a disaster for the community, as researchers need more time and computational resources to reproduce and compare existing models. In this paper, we propose U-mamba-net: a lightweight Mamba-based U-style model for speech separation in complex environments. Mamba is a state space sequence model that incorporates feature selection capabilities. U-style network is a fully convolutional neural network whose symmetric contracting and expansive paths are able to learn multi-resolution features. In our work, Mamba serves as a feature filter,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis
MethodsConcatenated Skip Connection · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · U-Net · Mamba: Linear-Time Sequence Modeling with Selective State Spaces · Feature Selection
