Improving Generalization of Medical Image Registration Foundation Model
Jing Hu, Kaiwei Yu, Hongjiang Xian, Shu Hu, Xin Wang

TL;DR
This paper enhances the generalization and robustness of medical image registration foundation models by integrating Sharpness-Aware Minimization, leading to improved cross-dataset performance and stability across diverse clinical scenarios.
Contribution
It introduces the use of Sharpness-Aware Minimization in foundation models to improve their generalization and robustness in medical image registration tasks.
Findings
Significant improvement in cross-dataset registration accuracy.
Enhanced model stability across diverse data distributions.
Better handling of complex clinical scenarios.
Abstract
Deformable registration is a fundamental task in medical image processing, aiming to achieve precise alignment by establishing nonlinear correspondences between images. Traditional methods offer good adaptability and interpretability but are limited by computational efficiency. Although deep learning approaches have significantly improved registration speed and accuracy, they often lack flexibility and generalizability across different datasets and tasks. In recent years, foundation models have emerged as a promising direction, leveraging large and diverse datasets to learn universal features and transformation patterns for image registration, thus demonstrating strong cross-task transferability. However, these models still face challenges in generalization and robustness when encountering novel anatomical structures, varying imaging conditions, or unseen modalities. To address these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Medical Imaging and Analysis
MethodsSharpness-Aware Minimization · Segment Anything Model · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
