Bridging Human Evaluation to Infrared and Visible Image Fusion

Jinyuan Liu; Xingyuan Li; Qingyun Mei; Haoyuan Xu; Zhiying Jiang; Long Ma; Risheng Liu; Xin Fan

arXiv:2603.03871·cs.CV·March 5, 2026

Bridging Human Evaluation to Infrared and Visible Image Fusion

Jinyuan Liu, Xingyuan Li, Qingyun Mei, Haoyuan Xu, Zhiying Jiang, Long Ma, Risheng Liu, Xin Fan

PDF

Open Access

TL;DR

This paper introduces a human-centered reinforcement learning framework for infrared and visible image fusion, utilizing a large-scale human feedback dataset and a reward model to produce images that better match human aesthetic preferences.

Contribution

It presents the first large-scale human feedback dataset for IVIF and a reward-guided training method to improve perceptual quality of fused images.

Findings

01

Achieved state-of-the-art performance in human-aligned image fusion.

02

Developed a domain-specific reward function based on human feedback.

03

Enhanced fusion results to better match human aesthetic preferences.

Abstract

Infrared and visible image fusion (IVIF) integrates complementary modalities to enhance scene perception. Current methods predominantly focus on optimizing handcrafted losses and objective metrics, often resulting in fusion outcomes that do not align with human visual preferences. This challenge is further exacerbated by the ill-posed nature of IVIF, which severely limits its effectiveness in human perceptual environments such as security surveillance and driver assistance systems. To address these limitations, we propose a feedback reinforcement framework that bridges human evaluation to infrared and visible image fusion. To address the lack of human-centric evaluation metrics and data, we introduce the first large-scale human feedback dataset for IVIF, containing multidimensional subjective scores and artifact annotations, and enriched by a fine-tuned large language model with expert…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Image Enhancement Techniques · Visual Attention and Saliency Detection