HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation
Sucheng Ren, Xiaomeng Li

TL;DR
HResFormer is a hybrid residual transformer model that effectively combines 2D and 3D information for improved volumetric medical image segmentation, addressing limitations of existing methods in capturing intra-slice details and managing computational costs.
Contribution
The paper introduces HResFormer, a novel hybrid model with a local-global fusion module and residual learning, enhancing 3D medical image segmentation by integrating fine-grained and global features.
Findings
Outperforms prior methods on benchmark datasets
Effectively fuses intra-slice and inter-slice information
Improves 3D understanding of anatomy in segmentation tasks
Abstract
Vision Transformer shows great superiority in medical image segmentation due to the ability in learning long-range dependency. For medical image segmentation from 3D data, such as computed tomography (CT), existing methods can be broadly classified into 2D-based and 3D-based methods. One key limitation in 2D-based methods is that the intra-slice information is ignored, while the limitation in 3D-based methods is the high computation cost and memory consumption, resulting in a limited feature representation for inner-slice information. During the clinical examination, radiologists primarily use the axial plane and then routinely review both axial and coronal planes to form a 3D understanding of anatomy. Motivated by this fact, our key insight is to design a hybrid model which can first learn fine-grained inner-slice information and then generate a 3D understanding of anatomy by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Brain Tumor Detection and Classification
MethodsAttention Is All You Need · Linear Layer · Adam · Layer Normalization · Dropout · Position-Wise Feed-Forward Layer · Label Smoothing · Dense Connections · Byte Pair Encoding · Residual Connection
