Towards RGB-NIR Cross-modality Image Registration and Beyond
Huadong Li, Shichao Dong, Jin Wang, Rong Fu, Minhao Jing, Jiajun, Liang, Haoqiang Fan, Renhe Ji

TL;DR
This paper introduces a new benchmark for RGB-NIR image registration, analyzes challenges caused by appearance discrepancies, and proposes a semantic guidance transformer to improve registration accuracy.
Contribution
The paper presents the RGB-NIR-IRegis benchmark, analyzes the impact of local feature inconsistency, and develops the SGFormer model utilizing semantic guidance for better registration.
Findings
RGB-NIR-IRegis enables comprehensive evaluation of registration methods.
Inconsistent local features significantly hinder registration performance.
SGFormer effectively mitigates the impact of local feature inconsistency.
Abstract
This paper focuses on the area of RGB(visible)-NIR(near-infrared) cross-modality image registration, which is crucial for many downstream vision tasks to fully leverage the complementary information present in visible and infrared images. In this field, researchers face two primary challenges - the absence of a correctly-annotated benchmark with viewpoint variations for evaluating RGB-NIR cross-modality registration methods and the problem of inconsistent local features caused by the appearance discrepancy between RGB-NIR cross-modality images. To address these challenges, we first present the RGB-NIR Image Registration (RGB-NIR-IRegis) benchmark, which, for the first time, enables fair and comprehensive evaluations for the task of RGB-NIR cross-modality image registration. Evaluations of previous methods highlight the significant challenges posed by our RGB-NIR-IRegis benchmark,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Industrial Vision Systems and Defect Detection · Image Retrieval and Classification Techniques
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections
