Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared

Yafei Zhang; Meng Ma; Huafeng Li; Yu Liu

arXiv:2603.08018·cs.CV·April 2, 2026

Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared

Yafei Zhang, Meng Ma, Huafeng Li, Yu Liu

PDF

1 Repo

TL;DR

This paper introduces a novel dictionary-guided framework for infrared-visible image fusion that effectively handles missing IR data by operating in the coefficient domain, improving perceptual quality and detection performance.

Contribution

It proposes a shared convolutional dictionary and a coefficient-domain inference-fusion pipeline, a first approach to address missing-IR fusion with interpretability and prior preservation.

Findings

01

Consistent improvements in perceptual quality under missing-IR conditions.

02

Enhanced downstream detection performance with the proposed method.

03

First framework to jointly learn a shared dictionary and perform coefficient-domain inference for missing-IR fusion.

Abstract

Infrared-visible (IR-VIS) image fusion is vital for perception and security, yet most methods rely on the availability of both modalities during training and inference. When the infrared modality is absent, pixel-space generative substitutes become hard to control and inherently lack interpretability. We address missing-IR fusion by proposing a dictionary-guided, coefficient-domain framework built upon a shared convolutional dictionary. The pipeline comprises three key components: (1) Joint Shared-dictionary Representation Learning (JSRL) learns a unified and interpretable atom space shared by both IR and VIS modalities; (2) VIS-Guided IR Inference (VGII) transfers VIS coefficients to pseudo-IR coefficients in the coefficient domain and performs a one-step closed-loop refinement guided by a frozen large language model as a weak semantic prior; and (3) Adaptive Fusion via Representation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

harukiv/DCMIF
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.