Towards Generating Realistic Underwater Images
Abdul-Kazeem Shamba

TL;DR
This study evaluates various image translation models, including contrastive learning and GANs, for generating realistic underwater images from synthetic data, highlighting the impact of depth information and different training strategies.
Contribution
It systematically compares paired and unpaired image translation methods, demonstrating how depth cues and contrastive learning improve underwater image realism.
Findings
Pix2pix achieves best FID scores among paired models.
CycleGAN provides competitive FID with cycle-consistency loss.
Depth information enhances realism but may reduce structural similarity.
Abstract
This paper explores the use of contrastive learning and generative adversarial networks for generating realistic underwater images from synthetic images with uniform lighting. We investigate the performance of image translation models for generating realistic underwater images using the VAROS dataset. Two key evaluation metrics, Fr\'echet Inception Distance (FID) and Structural Similarity Index Measure (SSIM), provide insights into the trade-offs between perceptual quality and structural preservation. For paired image translation, pix2pix achieves the best FID scores due to its paired supervision and PatchGAN discriminator, while the autoencoder model attains the highest SSIM, suggesting better structural fidelity despite producing blurrier outputs. Among unpaired methods, CycleGAN achieves a competitive FID score by leveraging cycle-consistency loss, whereas CUT, which replaces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging
MethodsHuMan(Expedia)||How do I get a human at Expedia? · GAN Least Squares Loss · Residual Connection · Dropout · Tanh Activation · Concatenated Skip Connection · Contrastive Learning · Residual Block · Sigmoid Activation · Convolution
