TL;DR
MagicFuse introduces a novel single-image fusion framework that enhances visual and semantic scene understanding from a single low-quality visible image using diffusion models and knowledge reinforcement.
Contribution
It extends traditional multi-modal fusion to a knowledge-based approach capable of deriving cross-spectral scene representations from a single image.
Findings
Achieves comparable or better performance than multi-modal methods.
Effectively mines scene information obscured in visible spectrum.
Supports downstream semantic decision-making.
Abstract
This paper focuses on a highly practical scenario: how to continue benefiting from the advantages of multi-modal image fusion under harsh conditions when only visible imaging sensors are available. To achieve this goal, we propose a novel concept of single-image fusion, which extends conventional data-level fusion to the knowledge level. Specifically, we develop MagicFuse, a novel single image fusion framework capable of deriving a comprehensive cross-spectral scene representation from a single low-quality visible image. MagicFuse first introduces an intra-spectral knowledge reinforcement branch and a cross-spectral knowledge generation branch based on the diffusion models. They mine scene information obscured in the visible spectrum and learn thermal radiation distribution patterns transferred to the infrared spectrum, respectively. Building on them, we design a multi-domain knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Image Enhancement Techniques · Remote-Sensing Image Classification
