MagicFuse: Single Image Fusion for Visual and Semantic Reinforcement

Hao Zhang; Yanping Zha; Zizhuo Li; Meiqi Gong; Jiayi Ma

arXiv:2602.01760·cs.CV·May 22, 2026

MagicFuse: Single Image Fusion for Visual and Semantic Reinforcement

Hao Zhang, Yanping Zha, Zizhuo Li, Meiqi Gong, Jiayi Ma

PDF

1 Repo

TL;DR

MagicFuse introduces a novel single-image fusion framework that enhances visual and semantic scene understanding from a single low-quality visible image using diffusion models and knowledge reinforcement.

Contribution

It extends traditional multi-modal fusion to a knowledge-based approach capable of deriving cross-spectral scene representations from a single image.

Findings

01

Achieves comparable or better performance than multi-modal methods.

02

Effectively mines scene information obscured in visible spectrum.

03

Supports downstream semantic decision-making.

Abstract

This paper focuses on a highly practical scenario: how to continue benefiting from the advantages of multi-modal image fusion under harsh conditions when only visible imaging sensors are available. To achieve this goal, we propose a novel concept of single-image fusion, which extends conventional data-level fusion to the knowledge level. Specifically, we develop MagicFuse, a novel single image fusion framework capable of deriving a comprehensive cross-spectral scene representation from a single low-quality visible image. MagicFuse first introduces an intra-spectral knowledge reinforcement branch and a cross-spectral knowledge generation branch based on the diffusion models. They mine scene information obscured in the visible spectrum and learn thermal radiation distribution patterns transferred to the infrared spectrum, respectively. Building on them, we design a multi-domain knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhayanping/MagicFuse
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Image Enhancement Techniques · Remote-Sensing Image Classification