Image-Conditional Diffusion Transformer for Underwater Image Enhancement
Xingyang Nie, Su Pan, Xiaoyu Zhai, Shifei Tao, Fengzhong Qu, Biao, Wang, Huilin Ge, and Guojie Xiao

TL;DR
This paper introduces a novel underwater image enhancement method using an image-conditional diffusion transformer that outperforms existing techniques in quality and scalability.
Contribution
It proposes replacing the U-Net backbone with a transformer in a diffusion model for improved underwater image enhancement.
Findings
ICDT-XL/2 achieves state-of-the-art enhancement quality.
The hybrid loss accelerates the sampling process.
The method demonstrates excellent scalability on the Underwater ImageNet dataset.
Abstract
Underwater image enhancement (UIE) has attracted much attention owing to its importance for underwater operation and marine engineering. Motivated by the recent advance in generative models, we propose a novel UIE method based on image-conditional diffusion transformer (ICDT). Our method takes the degraded underwater image as the conditional input and converts it into latent space where ICDT is applied. ICDT replaces the conventional U-Net backbone in a denoising diffusion probabilistic model (DDPM) with a transformer, and thus inherits favorable properties such as scalability from transformers. Furthermore, we train ICDT with a hybrid loss function involving variances to achieve better log-likelihoods, which meanwhile significantly accelerates the sampling process. We experimentally assess the scalability of ICDTs and compare with prior works in UIE on the Underwater ImageNet dataset.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Image and Signal Denoising Methods · Advanced Image Fusion Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Attention Is All You Need · Max Pooling · Concatenated Skip Connection · Convolution · U-Net · Diffusion
