Visible and Infrared Image Fusion Using Encoder-Decoder Network

Ferhat Can Ataman; G\"ozde Bozda\u{g}i Akar

arXiv:2412.08073·cs.CV·December 12, 2024

Visible and Infrared Image Fusion Using Encoder-Decoder Network

Ferhat Can Ataman, G\"ozde Bozda\u{g}i Akar

PDF

1 Repo

TL;DR

This paper introduces a novel convolutional encoder-decoder network for infrared and visible image fusion, achieving superior quality and real-time performance on embedded devices through a learning-based approach.

Contribution

It presents a new learning-based image fusion method using only convolution and pooling layers, with no-reference quality metrics, outperforming existing methods.

Findings

01

Better fusion quality than state-of-the-art methods.

02

Real-time performance on embedded devices.

03

Qualitative and quantitative analysis confirms effectiveness.

Abstract

The aim of multispectral image fusion is to combine object or scene features of images with different spectral characteristics to increase the perceptual quality. In this paper, we present a novel learning-based solution to image fusion problem focusing on infrared and visible spectrum images. The proposed solution utilizes only convolution and pooling layers together with a loss function using no-reference quality metrics. The analysis is performed qualitatively and quantitatively on various datasets. The results show better performance than state-of-the-art methods. Also, the size of our network enables real-time performance on embedded devices. Project codes can be found at \url{https://github.com/ferhatcan/pyFusionSR}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ferhatcan/pyfusionsr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution