xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart
Tianrun Chen, Chaotao Ding, Lanyun Zhu, Tao Xu, Deyi Ji, Yan Wang,, Ying Zang, Zejian Li

TL;DR
This paper introduces xLSTM-UNet, a novel deep learning architecture combining Vision-LSTM with UNet, which outperforms existing CNN, Transformer, and Mamba-based models in 2D and 3D biomedical image segmentation tasks.
Contribution
The paper presents xLSTM-UNet, a new segmentation backbone leveraging Vision-LSTM, demonstrating superior performance over existing models in biomedical image segmentation.
Findings
xLSTM-UNet outperforms CNN, Transformer, and Mamba-based models in multiple datasets.
The architecture effectively captures long-range dependencies in biomedical images.
Results are validated across 2D and 3D segmentation tasks.
Abstract
Convolutional Neural Networks (CNNs) and Vision Transformers (ViT) have been pivotal in biomedical image segmentation, yet their ability to manage long-range dependencies remains constrained by inherent locality and computational overhead. To overcome these challenges, in this technical report, we first propose xLSTM-UNet, a UNet structured deep learning neural network that leverages Vision-LSTM (xLSTM) as its backbone for medical image segmentation. xLSTM is a recently proposed as the successor of Long Short-Term Memory (LSTM) networks and have demonstrated superior performance compared to Transformers and State Space Models (SSMs) like Mamba in Neural Language Processing (NLP) and image classification (as demonstrated in Vision-LSTM, or ViL implementation). Here, xLSTM-UNet we designed extend the success in biomedical image segmentation domain. By integrating the local feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification
