UnitModule: A Lightweight Joint Image Enhancement Module for Underwater Object Detection

Zhuoyan Liu; Bo Wang; Ye Li; Jiaxian He; Yunfeng Li

arXiv:2309.04708·cs.CV·July 8, 2025·2 cites

UnitModule: A Lightweight Joint Image Enhancement Module for Underwater Object Detection

Zhuoyan Liu, Bo Wang, Ye Li, Jiaxian He, Yunfeng Li

PDF

Open Access

TL;DR

This paper introduces UnitModule, a lightweight, unsupervised joint image enhancement module designed to improve underwater object detection performance without requiring extra datasets or significant computational overhead.

Contribution

The paper proposes a novel plug-and-play UnitModule with unsupervised training, color cast prediction, and data augmentation techniques to enhance underwater images for detection tasks.

Findings

01

Achieves up to 2.6 AP improvement on YOLOv5-S.

02

Significantly improves detection across various models, especially smaller ones.

03

Maintains high inference speed with only 31K parameters.

Abstract

Underwater object detection faces the problem of underwater image degradation, which affects the performance of the detector. Underwater object detection methods based on noise reduction and image enhancement usually do not provide images preferred by the detector or require additional datasets. In this paper, we propose a plug-and-play \textbf{U}nderwater joi\textbf{n}t \textbf{i}mage enhancemen\textbf{t} \textbf{Module} (UnitModule) that provides the input image preferred by the detector. We design an unsupervised learning loss for the joint training of UnitModule with the detector without additional datasets to improve the interaction between UnitModule and the detector. Furthermore, a color cast predictor with the assisting color cast loss and a data augmentation called Underwater Color Random Transfer (UCRT) are designed to improve the performance of UnitModule on underwater images…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Enhancement Techniques · Underwater Vehicles and Communication Systems · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings