Breaking Modality Disparity: Harmonized Representation for Infrared and   Visible Image Registration

Zhiying Jiang; Zengxi Zhang; Jinyuan Liu; Xin Fan; Risheng Liu

arXiv:2304.05646·cs.CV·November 28, 2023·1 cites

Breaking Modality Disparity: Harmonized Representation for Infrared and Visible Image Registration

Zhiying Jiang, Zengxi Zhang, Jinyuan Liu, Xin Fan, Risheng Liu

PDF

Open Access

TL;DR

This paper introduces a novel scene-adaptive registration method for infrared and visible images that uses an invertible translation to create a modality-invariant domain, improving accuracy and reducing manual calibration efforts.

Contribution

It proposes a new invertible translation process for modality-invariant representation and a hierarchical framework for deformation rectification in infrared-visible image registration.

Findings

01

Outperforms state-of-the-art registration methods in experiments.

02

Introduces the first ground truth dataset for infrared-visible image alignment.

03

Demonstrates robustness in both synthetic and real-world scenarios.

Abstract

Since the differences in viewing range, resolution and relative position, the multi-modality sensing module composed of infrared and visible cameras needs to be registered so as to have more accurate scene perception. In practice, manual calibration-based registration is the most widely used process, and it is regularly calibrated to maintain accuracy, which is time-consuming and labor-intensive. To cope with these problems, we propose a scene-adaptive infrared and visible image registration. Specifically, in regard of the discrepancy between multi-modality images, an invertible translation process is developed to establish a modality-invariant domain, which comprehensively embraces the feature intensity and distribution of both infrared and visible modalities. We employ homography to simulate the deformation between different planes and develop a hierarchical framework to rectify the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Advanced Neural Network Applications