Cross-Modal Image Fusion Theory Guided by Subjective Visual Attention
Aiqing Fang, Xinbo Zhao, Yanning Zhang

TL;DR
This paper introduces a novel image fusion approach guided by subjective visual attention, inspired by human perception, to enhance robustness and contextual awareness in multi-modal image processing tasks.
Contribution
It proposes a multi-task auxiliary learning framework combined with subjective attention modeling, unifying human-like perception with image fusion techniques.
Findings
Outperforms state-of-the-art methods in robustness and contextual awareness
Effective fusion of infrared and visible images
Validated on combined vision and infrared-visible datasets
Abstract
The human visual perception system has very strong robustness and contextual awareness in a variety of image processing tasks. This robustness and the perception ability of contextual awareness is closely related to the characteristics of multi-task auxiliary learning and subjective attention of the human visual perception system. In order to improve the robustness and contextual awareness of image fusion tasks, we proposed a multi-task auxiliary learning image fusion theory guided by subjective attention. The image fusion theory effectively unifies the subjective task intention and prior knowledge of human brain. In order to achieve our proposed image fusion theory, we first analyze the mechanism of multi-task auxiliary learning, build a multi-task auxiliary learning network. Secondly, based on the human visual attention perception mechanism, we introduce the human visual attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Visual Attention and Saliency Detection · Infrared Target Detection Methodologies
