U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training
Zhuoyan Liu, Bo Wang, Bing Wang, Ye Li

TL;DR
U-DECN is a novel end-to-end underwater object detector that effectively handles underwater color cast noise, achieves high accuracy and speed, and is optimized for deployment on embedded underwater devices.
Contribution
The paper introduces U-DECN, a new ConvNet-based query end-to-end underwater object detector with specialized de-noising training and optimization for underwater environments.
Findings
Achieves 64.0 AP on DUO dataset.
Runs at 21 FPS on NVIDIA AGX Orin, 5 times faster than similar models.
Outperforms state-of-the-art query-based detectors in underwater scenarios.
Abstract
Underwater object detection has higher requirements of running speed and deployment efficiency for the detector due to its specific environmental challenges. NMS of two- or one-stage object detectors and transformer architecture of query-based end-to-end object detectors are not conducive to deployment on underwater embedded devices with limited processing power. As for the detrimental effect of underwater color cast noise, recent underwater object detectors make network architecture or training complex, which also hinders their application and deployment on unmanned underwater vehicles. In this paper, we propose the Underwater DECO with improved deNoising training (U-DECN), the query-based end-to-end object detector (with ConvNet encoder-decoder architecture) for underwater color cast noise that addresses the above problems. We integrate advanced technologies from DETR variants into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUnderwater Acoustics Research · Underwater Vehicles and Communication Systems · Advanced Neural Network Applications
MethodsLinear Layer · Deformable Attention Module · Residual Connection · Multi-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Convolution · Softmax
