RefineFormer3D: Efficient 3D Medical Image Segmentation via Adaptive Multi-Scale Transformer with Cross Attention Fusion
Kavyansh Tyagi, Vishwas Rathi, Puneet Goyal

TL;DR
RefineFormer3D is a lightweight, efficient transformer architecture for 3D medical image segmentation that balances high accuracy with low computational and memory demands, suitable for clinical deployment.
Contribution
It introduces a novel hierarchical transformer with GhostConv3D, MixFFN3D, and cross-attention fusion, significantly reducing parameters while maintaining state-of-the-art performance.
Findings
Achieves high Dice scores on ACDC and BraTS benchmarks.
Contains only 2.94 million parameters, much fewer than competitors.
Provides fast inference suitable for resource-limited clinical settings.
Abstract
Accurate and computationally efficient 3D medical image segmentation remains a critical challenge in clinical workflows. Transformer-based architectures often demonstrate superior global contextual modeling but at the expense of excessive parameter counts and memory demands, restricting their clinical deployment. We propose RefineFormer3D, a lightweight hierarchical transformer architecture that balances segmentation accuracy and computational efficiency for volumetric medical imaging. The architecture integrates three key components: (i) GhostConv3D-based patch embedding for efficient feature extraction with minimal redundancy, (ii) MixFFN3D module with low-rank projections and depthwise convolutions for parameter-efficient feature extraction, and (iii) a cross-attention fusion decoder enabling adaptive multi-scale skip connection integration. RefineFormer3D contains only 2.94M…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging and Analysis · Medical Image Segmentation Techniques
