Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws
Lin Guo, Xiaoqing Luo, Wei Xie, Zhancheng Zhang, Hui Li, Rui Wang, Zhenhua Feng, Xiaoning Song

TL;DR
This paper introduces HCLFuse, a novel infrared and visible image fusion method inspired by human cognition, combining information decomposition and diffusion models to enhance structural detail and fusion reliability.
Contribution
HCLFuse uniquely integrates information mapping, a multi-scale variational encoder, and a diffusion-based generative model guided by physical laws for improved image fusion.
Findings
Achieves state-of-the-art fusion performance across multiple datasets.
Significantly improves semantic segmentation metrics.
Enhances structural consistency and detail quality in fused images.
Abstract
Existing infrared and visible image fusion methods often face the dilemma of balancing modal information. Generative fusion methods reconstruct fused images by learning from data distributions, but their generative capabilities remain limited. Moreover, the lack of interpretability in modal information selection further affects the reliability and consistency of fusion results in complex scenarios. This manuscript revisits the essence of generative image fusion under the inspiration of human cognitive laws and proposes a novel infrared and visible image fusion method, termed HCLFuse. First, HCLFuse investigates the quantification theory of information mapping in unsupervised fusion networks, which leads to the design of a multi-scale mask-regulated variational bottleneck encoder. This encoder applies posterior probability modeling and information decomposition to extract accurate and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
