LadleNet: A Two-Stage UNet for Infrared Image to Visible Image   Translation Guided by Semantic Segmentation

Tonghui Zou; Lei Chen

arXiv:2308.06603·cs.CV·April 16, 2024·1 cites

LadleNet: A Two-Stage UNet for Infrared Image to Visible Image Translation Guided by Semantic Segmentation

Tonghui Zou, Lei Chen

PDF

Open Access 1 Repo

TL;DR

LadleNet introduces a two-stage U-net architecture guided by semantic segmentation to improve infrared to visible image translation, enhancing realism and generalization, especially in unseen scenarios.

Contribution

This paper proposes LadleNet, a novel two-stage U-net architecture with a semantic segmentation-guided approach, and LadleNet+ with a pre-trained segmentation module, improving translation quality and adaptability.

Findings

01

LadleNet outperforms existing methods with 12.4% higher SSIM.

02

LadleNet+ achieves 15.2% higher SSIM and 50.6% higher MS-SSIM.

03

Both models show significant improvements on the KAIST dataset.

Abstract

The translation of thermal infrared (TIR) images into visible light (VI) images plays a critical role in enhancing model performance and generalization capability, particularly in various fields such as registration and fusion of TIR and VI images. However, current research in this field faces challenges of insufficiently realistic image quality after translation and the difficulty of existing models in adapting to unseen scenarios. In order to develop a more generalizable image translation architecture, we conducted an analysis of existing translation architectures. By exploring the interpretability of intermediate modalities in existing translation architectures, we found that the intermediate modality in the image translation process for street scene images essentially performs semantic segmentation, distinguishing street images based on background and foreground patterns before…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ach-1914/ladlenet
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Advanced Image Processing Techniques · Image Enhancement Techniques

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · Max Pooling · U-Net