Exploiting Scale-Variant Attention for Segmenting Small Medical Objects

Wei Dai; Rui Liu; Zixuan Wu; Tianyi Wu; Min Wang; Junxian Zhou; Yixuan Yuan; Jun Liu

arXiv:2407.07720·eess.IV·January 15, 2026

Exploiting Scale-Variant Attention for Segmenting Small Medical Objects

Wei Dai, Rui Liu, Zixuan Wu, Tianyi Wu, Min Wang, Junxian Zhou, Yixuan Yuan, Jun Liu

PDF

1 Repo

TL;DR

This paper introduces SvANet, a novel attention-based network that effectively segments small medical objects by leveraging cross-scale features and vision transformers, significantly improving accuracy in challenging medical imaging tasks.

Contribution

The paper proposes a scale-variant attention network that combines cross-scale guidance, Monte Carlo attention, and vision transformers to enhance small object segmentation in medical images, addressing CNN limitations.

Findings

01

Achieved high mean Dice scores across multiple datasets for small medical objects.

02

Outperformed existing methods in segmenting objects occupying less than 1% of images.

03

Demonstrated robustness and superior accuracy in diverse medical imaging scenarios.

Abstract

Early detection and accurate diagnosis can predict the risk of malignant disease transformation, thereby increasing the probability of effective treatment. Identifying mild syndrome with small pathological regions serves as an ominous warning and is fundamental in the early diagnosis of diseases. While deep learning algorithms, particularly convolutional neural networks (CNNs), have shown promise in segmenting medical objects, analyzing small areas in medical images remains challenging. This difficulty arises due to information losses and compression defects from convolution and pooling operations in CNNs, which become more pronounced as the network deepens, especially for small medical objects. To address these challenges, we propose a novel scale-variant attention-based network (SvANet) for accurately segmenting small-scale objects in medical images. The SvANet consists of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anthonyweidai/SvANet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution