MedDIFT: Multi-Scale Diffusion-Based Correspondence in 3D Medical Imaging
Xingyu Zhang, Anna Reithmeir, Fryderyk K\"ogl, Rickmer Braren, Julia A. Schnabel, Daniel M. Lang

TL;DR
MedDIFT introduces a training-free 3D medical image correspondence method using multi-scale features from a pretrained diffusion model, improving anatomical matching without task-specific training.
Contribution
It leverages diffusion model representations for 3D medical image correspondence, eliminating the need for training and enhancing global semantic matching.
Findings
Effective in identifying anatomical correspondence in lung CTs
Multi-level feature fusion improves matching accuracy
Diffusion noise enhances feature robustness
Abstract
Accurate spatial correspondence between medical images is essential for longitudinal analysis, lesion tracking, and image-guided interventions. Medical image registration methods rely on local intensity-based similarity measures, which fail to capture global semantic structure and often yield mismatches in low-contrast or anatomically variable regions. Recent advances in diffusion models suggest that their intermediate representations encode rich geometric and semantic information. We present MedDIFT, a training-free 3D correspondence framework that leverages multi-scale features from a pretrained latent medical diffusion model as voxel descriptors. MedDIFT fuses diffusion activations into rich voxel-wise descriptors and matches them via cosine similarity, with an optional local-search prior. On a publicly available lung CT dataset, MedDIFT shows promising capability in identifying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · MRI in cancer diagnosis · AI in cancer detection
