Breaking Modality Disparity: Harmonized Representation for Infrared and Visible Image Registration
Zhiying Jiang, Zengxi Zhang, Jinyuan Liu, Xin Fan, Risheng Liu

TL;DR
This paper introduces a novel scene-adaptive registration method for infrared and visible images that uses an invertible translation to create a modality-invariant domain, improving accuracy and reducing manual calibration efforts.
Contribution
It proposes a new invertible translation process for modality-invariant representation and a hierarchical framework for deformation rectification in infrared-visible image registration.
Findings
Outperforms state-of-the-art registration methods in experiments.
Introduces the first ground truth dataset for infrared-visible image alignment.
Demonstrates robustness in both synthetic and real-world scenarios.
Abstract
Since the differences in viewing range, resolution and relative position, the multi-modality sensing module composed of infrared and visible cameras needs to be registered so as to have more accurate scene perception. In practice, manual calibration-based registration is the most widely used process, and it is regularly calibrated to maintain accuracy, which is time-consuming and labor-intensive. To cope with these problems, we propose a scene-adaptive infrared and visible image registration. Specifically, in regard of the discrepancy between multi-modality images, an invertible translation process is developed to establish a modality-invariant domain, which comprehensively embraces the feature intensity and distribution of both infrared and visible modalities. We employ homography to simulate the deformation between different planes and develop a hierarchical framework to rectify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Advanced Neural Network Applications
