LadleNet: A Two-Stage UNet for Infrared Image to Visible Image Translation Guided by Semantic Segmentation
Tonghui Zou, Lei Chen

TL;DR
LadleNet introduces a two-stage U-net architecture guided by semantic segmentation to improve infrared to visible image translation, enhancing realism and generalization, especially in unseen scenarios.
Contribution
This paper proposes LadleNet, a novel two-stage U-net architecture with a semantic segmentation-guided approach, and LadleNet+ with a pre-trained segmentation module, improving translation quality and adaptability.
Findings
LadleNet outperforms existing methods with 12.4% higher SSIM.
LadleNet+ achieves 15.2% higher SSIM and 50.6% higher MS-SSIM.
Both models show significant improvements on the KAIST dataset.
Abstract
The translation of thermal infrared (TIR) images into visible light (VI) images plays a critical role in enhancing model performance and generalization capability, particularly in various fields such as registration and fusion of TIR and VI images. However, current research in this field faces challenges of insufficiently realistic image quality after translation and the difficulty of existing models in adapting to unseen scenarios. In order to develop a more generalizable image translation architecture, we conducted an analysis of existing translation architectures. By exploring the interpretability of intermediate modalities in existing translation architectures, we found that the intermediate modality in the image translation process for street scene images essentially performs semantic segmentation, distinguishing street images based on background and foreground patterns before…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Advanced Image Processing Techniques · Image Enhancement Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · Max Pooling · U-Net
