MISFIT-V: Misaligned Image Synthesis and Fusion using Information from   Thermal and Visual

Aadhar Chauhan; Isaac Remy; Danny Broyles; and Karen Leung

arXiv:2309.13216·cs.CV·September 26, 2023

MISFIT-V: Misaligned Image Synthesis and Fusion using Information from Thermal and Visual

Aadhar Chauhan, Isaac Remy, Danny Broyles, and Karen Leung

PDF

Open Access 1 Repo

TL;DR

MISFIT-V introduces an unsupervised deep learning method combining GANs and cross-attention to improve thermal and visual image fusion for better human detection in challenging environments.

Contribution

It presents a novel unsupervised fusion approach using GANs and cross-attention to handle misalignment and environmental challenges in thermal-visual image fusion.

Findings

01

Enhanced robustness against misalignment

02

Improved performance in poor lighting and thermal conditions

03

Outperforms existing fusion methods

Abstract

Detecting humans from airborne visual and thermal imagery is a fundamental challenge for Wilderness Search-and-Rescue (WiSAR) teams, who must perform this function accurately in the face of immense pressure. The ability to fuse these two sensor modalities can potentially reduce the cognitive load on human operators and/or improve the effectiveness of computer vision object detection models. However, the fusion task is particularly challenging in the context of WiSAR due to hardware limitations and extreme environmental factors. This work presents Misaligned Image Synthesis and Fusion using Information from Thermal and Visual (MISFIT-V), a novel two-pronged unsupervised deep learning approach that utilizes a Generative Adversarial Network (GAN) and a cross-attention mechanism to capture the most relevant features from each modality. Experimental results show MISFIT-V offers enhanced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Aadharc/Visual_Thermal_Image_Fusion
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrared Target Detection Methodologies · Advanced Image Fusion Techniques · Visual Attention and Saliency Detection