Improving Generalization of Medical Image Registration Foundation Model

Jing Hu; Kaiwei Yu; Hongjiang Xian; Shu Hu; Xin Wang

arXiv:2505.06527·cs.CV·May 13, 2025

Improving Generalization of Medical Image Registration Foundation Model

Jing Hu, Kaiwei Yu, Hongjiang Xian, Shu Hu, Xin Wang

PDF

Open Access 1 Repo

TL;DR

This paper enhances the generalization and robustness of medical image registration foundation models by integrating Sharpness-Aware Minimization, leading to improved cross-dataset performance and stability across diverse clinical scenarios.

Contribution

It introduces the use of Sharpness-Aware Minimization in foundation models to improve their generalization and robustness in medical image registration tasks.

Findings

01

Significant improvement in cross-dataset registration accuracy.

02

Enhanced model stability across diverse data distributions.

03

Better handling of complex clinical scenarios.

Abstract

Deformable registration is a fundamental task in medical image processing, aiming to achieve precise alignment by establishing nonlinear correspondences between images. Traditional methods offer good adaptability and interpretability but are limited by computational efficiency. Although deep learning approaches have significantly improved registration speed and accuracy, they often lack flexibility and generalizability across different datasets and tasks. In recent years, foundation models have emerged as a promising direction, leveraging large and diverse datasets to learn universal features and transformation patterns for image registration, thus demonstrating strong cross-task transferability. However, these models still face challenges in generalization and robustness when encountering novel anatomical structures, varying imaging conditions, or unseen modalities. To address these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

promise13/fm_sam
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Medical Imaging and Analysis

MethodsSharpness-Aware Minimization · Segment Anything Model · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings