LM-Net: A Light-weight and Multi-scale Network for Medical Image Segmentation
Zhenkun Lu, Chaoyin She, Wei Wang, Qinghua Huang

TL;DR
LM-Net is a lightweight, multi-scale neural network that combines CNNs and Vision Transformers to improve medical image segmentation by capturing local details and global context, achieving state-of-the-art results.
Contribution
The paper introduces LM-Net, a novel lightweight architecture that integrates multi-scale features with local and global transformers for enhanced segmentation accuracy.
Findings
Achieves state-of-the-art segmentation results on three datasets.
Requires only 4.66G FLOPs and 5.4M parameters.
Effectively combines local textures and global semantics.
Abstract
Current medical image segmentation approaches have limitations in deeply exploring multi-scale information and effectively combining local detail textures with global contextual semantic information. This results in over-segmentation, under-segmentation, and blurred segmentation boundaries. To tackle these challenges, we explore multi-scale feature representations from different perspectives, proposing a novel, lightweight, and multi-scale architecture (LM-Net) that integrates advantages of both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to enhance segmentation accuracy. LM-Net employs a lightweight multi-branch module to capture multi-scale features at the same level. Furthermore, we introduce two modules to concurrently capture local detail textures and global semantics with multi-scale features at different levels: the Local Feature Transformer (LFT) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Brain Tumor Detection and Classification · Medical Imaging and Analysis
MethodsAbsolute Position Encodings · Softmax · Linear Layer · Attention Is All You Need · Adam · Residual Connection · Dropout · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing
