EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration
Abu Zahid Bin Aziz, Mokshagna Sai Teja Karanam, Tushar Kataria,, Shireen Y. Elhabian

TL;DR
EfficientMorph is a novel transformer-based architecture for 3D medical image registration that balances local and global attention, achieving high accuracy with significantly fewer parameters and improved efficiency.
Contribution
It introduces a plane-based attention mechanism and a Hi-Res tokenization strategy, setting new benchmarks with fewer parameters compared to existing models.
Findings
Sets a new benchmark on the OASIS dataset.
Uses 16-27x fewer parameters than previous models.
Balances local and global attention effectively.
Abstract
Transformers have emerged as the state-of-the-art architecture in medical image registration, outperforming convolutional neural networks (CNNs) by addressing their limited receptive fields and overcoming gradient instability in deeper models. Despite their success, transformer-based models require substantial resources for training, including data, memory, and computational power, which may restrict their applicability for end users with limited resources. In particular, existing transformer-based 3D image registration architectures face two critical gaps that challenge their efficiency and effectiveness. Firstly, although window-based attention mechanisms reduce the quadratic complexity of full attention by focusing on local regions, they often struggle to effectively integrate both local and global information. Secondly, the granularity of tokenization, a crucial factor in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
MethodsAttention Is All You Need · OASIS · Softmax · Linear Layer · Multi-Head Attention
