xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement
Nikolai Lund K\"uhne, Jan {\O}stergaard, Jesper Jensen, Zheng-Hua Tan

TL;DR
This paper introduces xLSTM-SENet, a novel xLSTM-based single-channel speech enhancement system that matches or surpasses state-of-the-art models like Conformers in performance, while offering better scalability.
Contribution
First application of xLSTM architecture to speech enhancement, demonstrating competitive performance and revealing key architectural benefits through ablation studies.
Findings
xLSTM-SENet outperforms similar complexity models on Voicebank+Demand.
xLSTM-based models match or exceed state-of-the-art performance.
Architectural choices like exponential gating improve effectiveness.
Abstract
While attention-based architectures, such as Conformers, excel in speech enhancement, they face challenges such as scalability with respect to input sequence length. In contrast, the recently proposed Extended Long Short-Term Memory (xLSTM) architecture offers linear scalability. However, xLSTM-based models remain unexplored for speech enhancement. This paper introduces xLSTM-SENet, the first xLSTM-based single-channel speech enhancement system. A comparative analysis reveals that xLSTM-and notably, even LSTM-can match or outperform state-of-the-art Mamba- and Conformer-based systems across various model sizes in speech enhancement on the VoiceBank+Demand dataset. Through ablation studies, we identify key architectural design choices such as exponential gating and bidirectionality contributing to its effectiveness. Our best xLSTM-based model, xLSTM-SENet2, outperforms state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Infant Health and Development
