LLM-VLM Fusion Framework for Autonomous Maritime Port Inspection using a Heterogeneous UAV-USV System
Muhayy Ud Din, Waseem Akram, Ahsan B. Bakht, Irfan Hussain

TL;DR
This paper presents an innovative autonomous maritime port inspection framework that combines Large Language Models and Vision Language Models with cooperative UAV-USV systems for scalable, context-aware inspection and compliance assessment.
Contribution
It introduces a novel LLM-VLM integrated framework that replaces traditional planning with AI-driven symbolic planning and semantic inspection for maritime port monitoring.
Findings
Validated in maritime simulator with realistic port scenarios
Demonstrated effective real-world robotic inspection trials
Achieved resource-efficient autonomous inspection capabilities
Abstract
Maritime port inspection plays a critical role in ensuring safety, regulatory compliance, and operational efficiency in complex maritime environments. However, existing inspection methods often rely on manual operations and conventional computer vision techniques that lack scalability and contextual understanding. This study introduces a novel integrated engineering framework that utilizes the synergy between Large Language Models (LLMs) and Vision Language Models (VLMs) to enable autonomous maritime port inspection using cooperative aerial and surface robotic platforms. The proposed framework replaces traditional state-machine mission planners with LLM-driven symbolic planning and improved perception pipelines through VLM-based semantic inspection, enabling context-aware and adaptive monitoring. The LLM module translates natural language mission instructions into executable symbolic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMaritime Navigation and Safety · Underwater Vehicles and Communication Systems · Oil Spill Detection and Mitigation
