Adaptive Enhancement and Dual-Pooling Sequential Attention for Lightweight Underwater Object Detection with YOLOv10
Md. Mushibur Rahman, Umme Fawzia Rahim, Enam Ahmed Taufik

TL;DR
This paper presents a lightweight, robust underwater object detection framework based on YOLOv10, incorporating adaptive enhancement, dual-pooling attention, and a novel loss to improve accuracy and efficiency in challenging underwater environments.
Contribution
It introduces a novel framework combining adaptive enhancement, dual-pooling sequential attention, and a specialized loss for improved underwater object detection with YOLOv10.
Findings
Achieved 88.9% mAP on RUOD dataset
Enhanced detection accuracy by over 6% compared to baseline
Maintained a compact model with only 2.8 million parameters
Abstract
Underwater object detection constitutes a pivotal endeavor within the realms of marine surveillance and autonomous underwater systems; however, it presents significant challenges due to pronounced visual impairments arising from phenomena such as light absorption, scattering, and diminished contrast. In response to these formidable challenges, this manuscript introduces a streamlined yet robust framework for underwater object detection, grounded in the YOLOv10 architecture. The proposed method integrates a Multi-Stage Adaptive Enhancement module to improve image quality, a Dual-Pooling Sequential Attention (DPSA) mechanism embedded into the backbone to strengthen multi-scale feature representation, and a Focal Generalized IoU Objectness (FGIoU) loss to jointly improve localization accuracy and objectness prediction under class imbalance. Comprehensive experimental evaluations conducted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
