Graph Representation Learning for Infrared and Visible Image Fusion
Jing Li, Lu Bai, Bin Yang, Chang Li, Lingfei Ma, and Edwin R. Hancock

TL;DR
This paper introduces a novel graph convolutional network approach for infrared and visible image fusion, effectively capturing non-local self-similarity and inter-modal relationships to improve fused image quality.
Contribution
It proposes a graph-based method utilizing GCNs to extract intra- and inter-modal non-local self-similarity, addressing limitations of CNNs and transformers in image fusion.
Findings
Outperforms existing methods on three datasets
Effectively captures non-local self-similarity
Enhances fused image quality with detailed features
Abstract
Infrared and visible image fusion aims to extract complementary features to synthesize a single fused image. Many methods employ convolutional neural networks (CNNs) to extract local features due to its translation invariance and locality. However, CNNs fail to consider the image's non-local self-similarity (NLss), though it can expand the receptive field by pooling operations, it still inevitably leads to information loss. In addition, the transformer structure extracts long-range dependence by considering the correlativity among all image patches, leading to information redundancy of such transformer-based methods. However, graph representation is more flexible than grid (CNN) or sequence (transformer structure) representation to address irregular objects, and graph can also construct the relationships among the spatially repeatable details or texture with far-space distance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Remote-Sensing Image Classification · Visual Attention and Saliency Detection
