Breaking Free from Fusion Rule: A Fully Semantic-driven Infrared and Visible Image Fusion
Yuhui Wu, Zhu Liu, Jinyuan Liu, Xin Fan, Risheng Liu

TL;DR
This paper introduces a semantic-level infrared and visible image fusion network that eliminates the need for handcrafted fusion rules, utilizing transformer-based multi-scale fusion blocks and semantic guidance to improve visual quality and high-level vision task performance.
Contribution
It proposes a fully semantic-driven fusion network with transformer-based multi-scale fusion blocks and a novel training strategy, removing reliance on experimental fusion rules.
Findings
Outperforms state-of-the-art methods in visual quality.
Enhances high-level vision task performance.
Avoids handcrafted fusion loss functions.
Abstract
Infrared and visible image fusion plays a vital role in the field of computer vision. Previous approaches make efforts to design various fusion rules in the loss functions. However, these experimental designed fusion rules make the methods more and more complex. Besides, most of them only focus on boosting the visual effects, thus showing unsatisfactory performance for the follow-up high-level vision tasks. To address these challenges, in this letter, we develop a semantic-level fusion network to sufficiently utilize the semantic guidance, emancipating the experimental designed fusion rules. In addition, to achieve a better semantic understanding of the feature fusion process, a fusion block based on the transformer is presented in a multi-scale manner. Moreover, we devise a regularization loss function, together with a training strategy, to fully use semantic guidance from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Visual Attention and Saliency Detection · Photoacoustic and Ultrasonic Imaging
