Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?

Pallabi Dutta; Soham Bose; Swalpa Kumar Roy; Sushmita Mitra

arXiv:2406.16993·eess.IV·August 8, 2025

Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?

Pallabi Dutta, Soham Bose, Swalpa Kumar Roy, Sushmita Mitra

PDF

Open Access 2 Repos

TL;DR

This paper introduces U-VixLSTM, a hybrid CNN and Vision-xLSTM architecture for medical 3D image segmentation that achieves high accuracy with lower computational costs, outperforming current state-of-the-art models.

Contribution

The paper proposes a novel U-VixLSTM architecture combining CNNs with Vision-xLSTM blocks for efficient and reliable medical image segmentation.

Findings

01

U-VixLSTM outperforms state-of-the-art networks on multiple datasets.

02

The model achieves high segmentation accuracy with reduced computational costs.

03

Code is publicly available for reproducibility.

Abstract

The development of efficient segmentation strategies for medical images has evolved from its initial dependence on Convolutional Neural Networks (CNNs) to the current investigation of hybrid models that combine CNNs with Vision Transformers (ViTs). There is an increasing focus on creating architectures that are both high-performing and computationally efficient, capable of being deployed on remote systems with limited resources. Although transformers can capture global dependencies in the input space, they face challenges from the corresponding high computational and storage expenses involved. This research investigates the integration of CNNs with Vision Extended Long Short-Term Memory (Vision-xLSTM)s by introducing the novel U-VixLSTM. The Vision-xLSTM blocks capture the temporal and global relationships within the patches extracted from the CNN feature maps. The convolutional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Medical Imaging and Analysis

MethodsFocus