Tell2Reg: Establishing spatial correspondence between images by the same   language prompts

Wen Yan; Qianye Yang; Shiqi Huang; Yipei Wang; Shonit Punwani; Mark; Emberton; Vasilis Stavrinides; Yipeng Hu; Dean Barratt

arXiv:2502.03118·cs.CV·February 6, 2025

Tell2Reg: Establishing spatial correspondence between images by the same language prompts

Wen Yan, Qianye Yang, Shiqi Huang, Yipei Wang, Shonit Punwani, Mark, Emberton, Vasilis Stavrinides, Yipeng Hu, Dean Barratt

PDF

Open Access 1 Repo

TL;DR

Tell2Reg introduces a fully automated, training-free image registration method that leverages large multimodal models and language prompts to establish spatial correspondence, demonstrated on challenging prostate MRI registration tasks.

Contribution

The paper presents a novel registration approach using language prompts and pre-trained models, eliminating the need for training data and outperforming unsupervised methods.

Findings

01

Outperforms unsupervised registration methods

02

Comparable to weakly-supervised methods

03

Reveals potential link between language semantics and spatial correspondence

Abstract

Spatial correspondence can be represented by pairs of segmented regions, such that the image registration networks aim to segment corresponding regions rather than predicting displacement fields or transformation parameters. In this work, we show that such a corresponding region pair can be predicted by the same language prompt on two different images using the pre-trained large multimodal models based on GroundingDINO and SAM. This enables a fully automated and training-free registration algorithm, potentially generalisable to a wide range of image registration tasks. In this paper, we present experimental results using one of the challenging tasks, registering inter-subject prostate MR images, which involves both highly variable intensity and morphology between patients. Tell2Reg is training-free, eliminating the need for costly and time-consuming data curation and labelling that was…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yanwenci/tell2reg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Image Retrieval and Classification Techniques · Natural Language Processing Techniques

MethodsSegment Anything Model