TL;DR
This paper introduces a novel dictionary-guided framework for infrared-visible image fusion that effectively handles missing IR data by operating in the coefficient domain, improving perceptual quality and detection performance.
Contribution
It proposes a shared convolutional dictionary and a coefficient-domain inference-fusion pipeline, a first approach to address missing-IR fusion with interpretability and prior preservation.
Findings
Consistent improvements in perceptual quality under missing-IR conditions.
Enhanced downstream detection performance with the proposed method.
First framework to jointly learn a shared dictionary and perform coefficient-domain inference for missing-IR fusion.
Abstract
Infrared-visible (IR-VIS) image fusion is vital for perception and security, yet most methods rely on the availability of both modalities during training and inference. When the infrared modality is absent, pixel-space generative substitutes become hard to control and inherently lack interpretability. We address missing-IR fusion by proposing a dictionary-guided, coefficient-domain framework built upon a shared convolutional dictionary. The pipeline comprises three key components: (1) Joint Shared-dictionary Representation Learning (JSRL) learns a unified and interpretable atom space shared by both IR and VIS modalities; (2) VIS-Guided IR Inference (VGII) transfers VIS coefficients to pseudo-IR coefficients in the coefficient domain and performs a one-step closed-loop refinement guided by a frozen large language model as a weak semantic prior; and (3) Adaptive Fusion via Representation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
