LM-Net: A Light-weight and Multi-scale Network for Medical Image   Segmentation

Zhenkun Lu; Chaoyin She; Wei Wang; Qinghua Huang

arXiv:2501.03838·cs.CV·January 8, 2025

LM-Net: A Light-weight and Multi-scale Network for Medical Image Segmentation

Zhenkun Lu, Chaoyin She, Wei Wang, Qinghua Huang

PDF

Open Access 1 Repo

TL;DR

LM-Net is a lightweight, multi-scale neural network that combines CNNs and Vision Transformers to improve medical image segmentation by capturing local details and global context, achieving state-of-the-art results.

Contribution

The paper introduces LM-Net, a novel lightweight architecture that integrates multi-scale features with local and global transformers for enhanced segmentation accuracy.

Findings

01

Achieves state-of-the-art segmentation results on three datasets.

02

Requires only 4.66G FLOPs and 5.4M parameters.

03

Effectively combines local textures and global semantics.

Abstract

Current medical image segmentation approaches have limitations in deeply exploring multi-scale information and effectively combining local detail textures with global contextual semantic information. This results in over-segmentation, under-segmentation, and blurred segmentation boundaries. To tackle these challenges, we explore multi-scale feature representations from different perspectives, proposing a novel, lightweight, and multi-scale architecture (LM-Net) that integrates advantages of both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to enhance segmentation accuracy. LM-Net employs a lightweight multi-branch module to capture multi-scale features at the same level. Furthermore, we introduce two modules to concurrently capture local detail textures and global semantics with multi-scale features at different levels: the Local Feature Transformer (LFT) and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Asunatan/LM-Net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging · Brain Tumor Detection and Classification · Medical Imaging and Analysis

MethodsAbsolute Position Encodings · Softmax · Linear Layer · Attention Is All You Need · Adam · Residual Connection · Dropout · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing