Real-time Ship Recognition and Georeferencing for the Improvement of Maritime Situational Awareness
Borja Carrillo Perez

TL;DR
This paper introduces a deep learning-based system for real-time ship recognition and georeferencing, utilizing a novel dataset and architecture to enhance maritime situational awareness and operational decision-making.
Contribution
It presents a new dataset, ShipSG, and a custom segmentation architecture, ScatYOLOv8+CBAM, optimized for embedded systems, advancing real-time ship detection and georeferencing methods.
Findings
Achieved 75.46% mAP with 25.3 ms per frame on embedded hardware.
Enhanced small and distant ship recognition by 8-11% mAP using slicing mechanism.
Georeferencing errors of 18 m at 400 m distance, 44 m at 1200 m distance.
Abstract
In an era where maritime infrastructures are crucial, advanced situational awareness solutions are increasingly important. The use of optical camera systems can allow real-time usage of maritime footage. This thesis presents an investigation into leveraging deep learning and computer vision to advance real-time ship recognition and georeferencing for the improvement of maritime situational awareness. A novel dataset, ShipSG, is introduced, containing 3,505 images and 11,625 ship masks with corresponding class and geographic position. After an exploration of state-of-the-art, a custom real-time segmentation architecture, ScatYOLOv8+CBAM, is designed for the NVIDIA Jetson AGX Xavier embedded system. This architecture adds the 2D scattering transform and attention mechanisms to YOLOv8, achieving an mAP of 75.46% and an 25.3 ms per frame, outperforming state-of-the-art methods by over 5%.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need · You Only Look Once
