Multi-Task Learning-Enabled Automatic Vessel Draft Reading for Intelligent Maritime Surveillance
Jingxiang Qu, Ryan Wen Liu, Chenjie Zhao, Yu Guo, Sendren Sheng-Dong, Xu, Fenghua Zhu, and Yisheng Lv

TL;DR
This paper introduces MTL-VDR, a multi-task learning approach that improves vessel draft reading accuracy and robustness for maritime surveillance, achieving real-time performance and outperforming existing methods.
Contribution
The paper presents a novel multi-task learning framework for vessel draft reading that integrates detection, recognition, segmentation, and correction, enhancing robustness and efficiency.
Findings
Achieves over 40 FPS for real-time application
Outperforms state-of-the-art methods in accuracy and robustness
Effectively handles complex conditions like damaged scales
Abstract
The accurate and efficient vessel draft reading (VDR) is an important component of intelligent maritime surveillance, which could be exploited to assist in judging whether the vessel is normally loaded or overloaded. The computer vision technique with an excellent price-to-performance ratio has become a popular medium to estimate vessel draft depth. However, the traditional estimation methods easily suffer from several limitations, such as sensitivity to low-quality images, high computational cost, etc. In this work, we propose a multi-task learning-enabled computational method (termed MTL-VDR) for generating highly reliable VDR. In particular, our MTL-VDR mainly consists of four components, i.e., draft mark detection, draft scale recognition, vessel/water segmentation, and final draft depth estimation. We first construct a benchmark dataset related to draft mark detection and employ a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMaritime Navigation and Safety · Maritime and Coastal Archaeology · Handwritten Text Recognition Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
