GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion
Huan Kang, Hui Li, Xiao-Jun Wu, Tianyang Xu, Rui Wang, Chunyang Cheng, Josef Kittler

TL;DR
This paper introduces GrFormer, a transformer model on the Grassmann manifold that improves infrared and visible image fusion by capturing intrinsic topological structures and balancing semantic and detail information.
Contribution
The paper proposes a novel Grassmann manifold-based attention mechanism and a cross-modal fusion strategy for enhanced image fusion performance.
Findings
Outperforms state-of-the-art methods on multiple benchmarks.
Effectively decouples high-frequency details and low-frequency semantics.
Demonstrates superior qualitative and quantitative fusion results.
Abstract
In the field of image fusion, promising progress has been made by modeling data from different modalities as linear subspaces. However, in practice, the source images are often located in a non-Euclidean space, where the Euclidean methods usually cannot encapsulate the intrinsic topological structure. Typically, the inner product performed in the Euclidean space calculates the algebraic similarity rather than the semantic similarity, which results in undesired attention output and a decrease in fusion performance. While the balance of low-level details and high-level semantics should be considered in infrared and visible image fusion task. To address this issue, in this paper, we propose a novel attention mechanism based on Grassmann manifold for infrared and visible image fusion (GrFormer). Specifically, our method constructs a low-rank subspace mapping through projection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Infrared Target Detection Methodologies
