UnitModule: A Lightweight Joint Image Enhancement Module for Underwater Object Detection
Zhuoyan Liu, Bo Wang, Ye Li, Jiaxian He, Yunfeng Li

TL;DR
This paper introduces UnitModule, a lightweight, unsupervised joint image enhancement module designed to improve underwater object detection performance without requiring extra datasets or significant computational overhead.
Contribution
The paper proposes a novel plug-and-play UnitModule with unsupervised training, color cast prediction, and data augmentation techniques to enhance underwater images for detection tasks.
Findings
Achieves up to 2.6 AP improvement on YOLOv5-S.
Significantly improves detection across various models, especially smaller ones.
Maintains high inference speed with only 31K parameters.
Abstract
Underwater object detection faces the problem of underwater image degradation, which affects the performance of the detector. Underwater object detection methods based on noise reduction and image enhancement usually do not provide images preferred by the detector or require additional datasets. In this paper, we propose a plug-and-play \textbf{U}nderwater joi\textbf{n}t \textbf{i}mage enhancemen\textbf{t} \textbf{Module} (UnitModule) that provides the input image preferred by the detector. We design an unsupervised learning loss for the joint training of UnitModule with the detector without additional datasets to improve the interaction between UnitModule and the detector. Furthermore, a color cast predictor with the assisting color cast loss and a data augmentation called Underwater Color Random Transfer (UCRT) are designed to improve the performance of UnitModule on underwater images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Underwater Vehicles and Communication Systems · Advanced Neural Network Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
