UNeXt: MLP-based Rapid Medical Image Segmentation Network
Jeya Maria Jose Valanarasu, Vishal M. Patel

TL;DR
UNeXt introduces a parameter-efficient MLP-based network for rapid and accurate medical image segmentation, significantly reducing complexity and increasing speed compared to traditional methods.
Contribution
The paper presents a novel MLP-based segmentation network with tokenized MLP blocks and channel shifting, achieving faster inference with fewer parameters and better accuracy.
Findings
Reduces parameters by 72x
Decreases computational complexity by 68x
Improves inference speed by 10x
Abstract
UNet and its latest extensions like TransUNet have been the leading medical image segmentation methods in recent years. However, these networks cannot be effectively adopted for rapid image segmentation in point-of-care applications as they are parameter-heavy, computationally complex and slow to use. To this end, we propose UNeXt which is a Convolutional multilayer perceptron (MLP) based network for image segmentation. We design UNeXt in an effective way with an early convolutional stage and a MLP stage in the latent stage. We propose a tokenized MLP block where we efficiently tokenize and project the convolutional features and use MLPs to model the representation. To further boost the performance, we propose shifting the channels of the inputs while feeding in to MLPs so as to focus on learning local dependencies. Using tokenized MLPs in latent space reduces the number of parameters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Radiomics and Machine Learning in Medical Imaging
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
