CoFusion: Multispectral and Hyperspectral Image Fusion via Spectral Coordinate Attention

Baisong Li

arXiv:2604.10584·cs.CV·April 15, 2026

CoFusion: Multispectral and Hyperspectral Image Fusion via Spectral Coordinate Attention

Baisong Li

PDF

TL;DR

CoFusion is a novel framework for multispectral and hyperspectral image fusion that models cross-scale and cross-modal dependencies to improve spatial detail and spectral fidelity.

Contribution

It introduces a unified spatial-spectral collaborative fusion framework with multi-scale architecture and specialized modules for enhanced image reconstruction.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets.

02

Achieves better spatial detail and spectral fidelity.

03

Demonstrates robustness across multiple datasets.

Abstract

Multispectral and Hyperspectral Image Fusion (MHIF) aims to reconstruct high-resolution images by integrating low-resolution hyperspectral images (LRHSI) and high-resolution multispectral images (HRMSI). However, existing methods face limitations in modeling cross-scale interactions and spatial-spectral collaboration, making it difficult to achieve an optimal trade-off between spatial detail enhancement and spectral fidelity. To address this challenge, we propose CoFusion: a unified spatial-spectral collaborative fusion framework that explicitly models cross-scale and cross-modal dependencies. Specifically, a Multi-Scale Generator (MSG) is designed to construct a three-level pyramidal architecture, enabling the effective integration of global semantics and local details. Within each scale, a dual-branch strategy is employed: the Spatial Coordinate-Aware Mixing module (SpaCAM) is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.