Multi-Task Learning-Enabled Automatic Vessel Draft Reading for   Intelligent Maritime Surveillance

Jingxiang Qu; Ryan Wen Liu; Chenjie Zhao; Yu Guo; Sendren Sheng-Dong; Xu; Fenghua Zhu; and Yisheng Lv

arXiv:2310.07212·cs.CV·October 12, 2023

Multi-Task Learning-Enabled Automatic Vessel Draft Reading for Intelligent Maritime Surveillance

Jingxiang Qu, Ryan Wen Liu, Chenjie Zhao, Yu Guo, Sendren Sheng-Dong, Xu, Fenghua Zhu, and Yisheng Lv

PDF

Open Access

TL;DR

This paper introduces MTL-VDR, a multi-task learning approach that improves vessel draft reading accuracy and robustness for maritime surveillance, achieving real-time performance and outperforming existing methods.

Contribution

The paper presents a novel multi-task learning framework for vessel draft reading that integrates detection, recognition, segmentation, and correction, enhancing robustness and efficiency.

Findings

01

Achieves over 40 FPS for real-time application

02

Outperforms state-of-the-art methods in accuracy and robustness

03

Effectively handles complex conditions like damaged scales

Abstract

The accurate and efficient vessel draft reading (VDR) is an important component of intelligent maritime surveillance, which could be exploited to assist in judging whether the vessel is normally loaded or overloaded. The computer vision technique with an excellent price-to-performance ratio has become a popular medium to estimate vessel draft depth. However, the traditional estimation methods easily suffer from several limitations, such as sensitivity to low-quality images, high computational cost, etc. In this work, we propose a multi-task learning-enabled computational method (termed MTL-VDR) for generating highly reliable VDR. In particular, our MTL-VDR mainly consists of four components, i.e., draft mark detection, draft scale recognition, vessel/water segmentation, and final draft depth estimation. We first construct a benchmark dataset related to draft mark detection and employ a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMaritime Navigation and Safety · Maritime and Coastal Archaeology · Handwritten Text Recognition Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings