# GAN-based underwater image enhancement and scene classification using transfer learning

**Authors:** Amani Homoud, Saptarshi Das

PMC · DOI: 10.1371/journal.pone.0345593 · PLOS One · 2026-03-27

## TL;DR

This paper explores using GANs and deep learning to enhance underwater images and classify marine species, improving understanding of underwater ecosystems.

## Contribution

The paper introduces a novel pipeline combining GAN-based enhancement and transfer learning for underwater image analysis.

## Key findings

- The Gray World algorithm effectively reduces color distortion in underwater images.
- ESRGAN outperforms traditional methods in enhancing noisy underwater images.
- Transfer learning with VGG16, ResNet50, and DenseNet121 improves classification accuracy of marine species.

## Abstract

This paper provides an exploratory analysis of underwater video analysis techniques to enhance image quality and facilitate accurate classification of different marine species. Our methodology progresses through several steps, beginning with the quality of underwater images that might be reduced by variables such as decreased light intensity, color modification, and limited visibility. These attributes pose significant challenges to develop accurate object detection methods. This paper outlines the processing pipeline employed to enhance the quality of images from underwater videos and facilitate precise object detection. First, we use the Gray World (GW) algorithm for image enhancement, effectively mitigating the challenges of aquatic environment, such as color distortion and low contrast. Subsequently, we compare the traditional Histogram Equalization (HE) and the Contrast Limited Adaptive Histogram Equalization (CLAHE) algorithms to assess their efficacy in enhancing underwater image quality. Next, Canny Edge Detection is utilized to identify the prominent features in the enhanced images, aiding in subsequent classification tasks. Next, three state-of-the-art deep learning models, Visual Geometry Group 16-layer network (VGG16), 50-layer Residual Network (ResNet50), and 121-layer Densely Connected Convolutional Network (DenseNet121), are leveraged through transfer learning to classify underwater species, including fish, coral reefs, and sea turtles. Finally, by enhancing the visual quality of underwater images, our research contributes to better understanding of the underwater ecosystem and supports conservation efforts. Enhanced Super-Resolution GAN (ESRGAN) is a superior Generative Adversarial Network (GAN) technique to improve the quality of noisy images. This paper contributes to advancing the field of underwater image and video analysis, offering valuable insights for applications in marine biology, environmental monitoring, underwater robotics, and autonomous navigation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13029775/full.md

## Figures

23 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13029775/full.md

## References

56 references — full list in the complete paper: https://tomesphere.com/paper/PMC13029775/full.md

---
Source: https://tomesphere.com/paper/PMC13029775