CoMIR: Contrastive Multimodal Image Representation for Registration
Nicolas Pielawski, Elisabeth Wetzer, Johan \"Ofverstedt, Jiahao Lu,, Carolina W\"ahlby, Joakim Lindblad, Nata\v{s}a Sladoje

TL;DR
CoMIR introduces a contrastive learning method to generate shared image representations that facilitate multimodal image registration, outperforming existing approaches especially in challenging biomedical and remote sensing datasets.
Contribution
The paper presents a novel contrastive coding approach with a hyperparameter-free modification to learn rotationally equivariant shared representations for multimodal image registration.
Findings
CoMIR representations enable effective registration of multimodal images.
The method outperforms GAN-based translation and existing registration techniques.
Representations are stable across different initializations and hyperparameters.
Abstract
We propose contrastive coding to learn shared, dense image representations, referred to as CoMIRs (Contrastive Multimodal Image Representations). CoMIRs enable the registration of multimodal images where existing registration methods often fail due to a lack of sufficiently similar image structures. CoMIRs reduce the multimodal registration problem to a monomodal one, in which general intensity-based, as well as feature-based, registration algorithms can be applied. The method involves training one neural network per modality on aligned images, using a contrastive loss based on noise-contrastive estimation (InfoNCE). Unlike other contrastive coding methods, used for, e.g., classification, our approach generates image-like representations that contain the information shared between modalities. We introduce a novel, hyperparameter-free modification to InfoNCE, to enforce rotational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
MethodsInfoNCE
