GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion

Huan Kang; Hui Li; Xiao-Jun Wu; Tianyang Xu; Rui Wang; Chunyang Cheng; Josef Kittler

arXiv:2506.14384·cs.CV·June 18, 2025

GrFormer: A Novel Transformer on Grassmann Manifold for Infrared and Visible Image Fusion

Huan Kang, Hui Li, Xiao-Jun Wu, Tianyang Xu, Rui Wang, Chunyang Cheng, Josef Kittler

PDF

Open Access

TL;DR

This paper introduces GrFormer, a transformer model on the Grassmann manifold that improves infrared and visible image fusion by capturing intrinsic topological structures and balancing semantic and detail information.

Contribution

The paper proposes a novel Grassmann manifold-based attention mechanism and a cross-modal fusion strategy for enhanced image fusion performance.

Findings

01

Outperforms state-of-the-art methods on multiple benchmarks.

02

Effectively decouples high-frequency details and low-frequency semantics.

03

Demonstrates superior qualitative and quantitative fusion results.

Abstract

In the field of image fusion, promising progress has been made by modeling data from different modalities as linear subspaces. However, in practice, the source images are often located in a non-Euclidean space, where the Euclidean methods usually cannot encapsulate the intrinsic topological structure. Typically, the inner product performed in the Euclidean space calculates the algebraic similarity rather than the semantic similarity, which results in undesired attention output and a decrease in fusion performance. While the balance of low-level details and high-level semantics should be considered in infrared and visible image fusion task. To address this issue, in this paper, we propose a novel attention mechanism based on Grassmann manifold for infrared and visible image fusion (GrFormer). Specifically, our method constructs a low-rank subspace mapping through projection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Infrared Target Detection Methodologies