Bridging Human Evaluation to Infrared and Visible Image Fusion
Jinyuan Liu, Xingyuan Li, Qingyun Mei, Haoyuan Xu, Zhiying Jiang, Long Ma, Risheng Liu, Xin Fan

TL;DR
This paper introduces a human-centered reinforcement learning framework for infrared and visible image fusion, utilizing a large-scale human feedback dataset and a reward model to produce images that better match human aesthetic preferences.
Contribution
It presents the first large-scale human feedback dataset for IVIF and a reward-guided training method to improve perceptual quality of fused images.
Findings
Achieved state-of-the-art performance in human-aligned image fusion.
Developed a domain-specific reward function based on human feedback.
Enhanced fusion results to better match human aesthetic preferences.
Abstract
Infrared and visible image fusion (IVIF) integrates complementary modalities to enhance scene perception. Current methods predominantly focus on optimizing handcrafted losses and objective metrics, often resulting in fusion outcomes that do not align with human visual preferences. This challenge is further exacerbated by the ill-posed nature of IVIF, which severely limits its effectiveness in human perceptual environments such as security surveillance and driver assistance systems. To address these limitations, we propose a feedback reinforcement framework that bridges human evaluation to infrared and visible image fusion. To address the lack of human-centric evaluation metrics and data, we introduce the first large-scale human feedback dataset for IVIF, containing multidimensional subjective scores and artifact annotations, and enriched by a fine-tuned large language model with expert…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Image Enhancement Techniques · Visual Attention and Saliency Detection
