XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration

Ismail Can Yagmur; Hasan F. Ates; Bahadir K. Gunturk

arXiv:2411.07430·cs.CV·March 3, 2026

XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration

Ismail Can Yagmur, Hasan F. Ates, Bahadir K. Gunturk

PDF

1 Repo

TL;DR

XPoint is a self-supervised, modular framework for multispectral image registration that adapts quickly across modalities, outperforming existing methods in feature matching and registration tasks.

Contribution

Introduces XPoint, a flexible self-supervised architecture that enables rapid adaptation and fine-tuning for multispectral image registration across diverse modalities.

Findings

01

Outperforms or matches state-of-the-art methods in five datasets

02

Demonstrates effective adaptation to various spectral modalities

03

Shows robustness in feature matching and geometric constraints

Abstract

Accurate multispectral image matching presents significant challenges due to non-linear intensity variations across spectral modalities, extreme viewpoint changes, and the scarcity of labeled datasets. Current state-of-the-art methods are typically specialized for a single spectral difference, such as visibleinfrared, and struggle to adapt to other modalities due to their reliance on expensive supervision, such as depth maps or camera poses. To address the need for rapid adaptation across modalities, we introduce XPoint, a self-supervised, modular image-matching framework designed for adaptive training and fine-tuning on aligned multispectral datasets, allowing users to customize key components based on their specific tasks. XPoint employs modularity and self-supervision to allow for the adjustment of elements such as the base detector, which generates pseudoground truth keypoints…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

canyagmur/xpoint
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBalanced Selection