DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion

Yuchen Guo; Ruoxiang Xu; Rongcheng Li; Weifeng Su

arXiv:2409.10080·cs.CV·May 27, 2025

DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion

Yuchen Guo, Ruoxiang Xu, Rongcheng Li, Weifeng Su

PDF

Open Access

TL;DR

DAE-Fuse introduces an adaptive autoencoder framework that produces sharp, natural multi-modality image fusion results, extending to video with temporal consistency, and outperforms existing methods on various benchmarks.

Contribution

It presents a novel two-phase discriminative autoencoder for improved image fusion, extending techniques to video with temporal consistency, and demonstrates superior performance on multiple datasets.

Findings

01

Achieves state-of-the-art fusion quality on benchmarks.

02

Extends image fusion to video with temporal consistency.

03

Demonstrates superior generalizability to medical imaging.

Abstract

In extreme scenarios such as nighttime or low-visibility environments, achieving reliable perception is critical for applications like autonomous driving, robotics, and surveillance. Multi-modality image fusion, particularly integrating infrared imaging, offers a robust solution by combining complementary information from different modalities to enhance scene understanding and decision-making. However, current methods face significant limitations: GAN-based approaches often produce blurry images that lack fine-grained details, while AE-based methods may introduce bias toward specific modalities, leading to unnatural fusion results. To address these challenges, we propose DAE-Fuse, a novel two-phase discriminative autoencoder framework that generates sharp and natural fused images. Furthermore, We pioneer the extension of image fusion techniques from static images to the video domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques