MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders   for Infrared and Visible Image Fusion via Guided Training

Jiayang Li; Junjun Jiang; Pengwei Liang; Jiayi Ma; Liqiang Nie

arXiv:2404.11016·cs.CV·February 11, 2025·1 cites

MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training

Jiayang Li, Junjun Jiang, Pengwei Liang, Jiayi Ma, Liqiang Nie

PDF

Open Access

TL;DR

MaeFuse leverages pretrained Masked Autoencoders to extract omni features for infrared and visible image fusion, using guided training to improve feature integration and preserve details across modalities.

Contribution

The paper introduces MaeFuse, a novel autoencoder model that utilizes pretrained MAE encoders and a guided training strategy for improved image fusion performance.

Findings

01

Achieves high-quality fusion results across public datasets.

02

Effectively preserves details from both infrared and visible images.

03

Outperforms existing methods in visual quality and task-specific applications.

Abstract

In this paper, we introduce MaeFuse, a novel autoencoder model designed for Infrared and Visible Image Fusion (IVIF). The existing approaches for image fusion often rely on training combined with downstream tasks to obtain highlevel visual information, which is effective in emphasizing target objects and delivering impressive results in visual quality and task-specific applications. Instead of being driven by downstream tasks, our model called MaeFuse utilizes a pretrained encoder from Masked Autoencoders (MAE), which facilities the omni features extraction for low-level reconstruction and high-level vision tasks, to obtain perception friendly features with a low cost. In order to eliminate the domain gap of different modal features and the block effect caused by the MAE encoder, we further develop a guided training strategy. This strategy is meticulously crafted to ensure that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Image Enhancement Techniques · Image and Signal Denoising Methods

MethodsMasked autoencoder