ShipTraj-R1: Reinforcing Ship Trajectory Prediction in Large Language Models via Group Relative Policy Optimization
Yang Zhan, Yunhao Li, Zhang Chao, Yuxu Lu, Yan Li

TL;DR
ShipTraj-R1 leverages large language models with a novel reinforcement learning framework to improve ship trajectory prediction accuracy by reformulating it as a text generation task, guided by dynamic prompts and rule-based rewards.
Contribution
The paper introduces ShipTraj-R1, a new LLM-based approach that applies group relative policy optimization to enhance maritime trajectory prediction performance.
Findings
Achieves the lowest prediction error on real-world datasets.
Outperforms existing deep learning and LLM baselines.
Demonstrates effective reasoning and adaptive prediction in maritime scenarios.
Abstract
Recent advancements in reinforcement fine-tuning have significantly improved the reasoning ability of large language models (LLMs). In particular, methods such as group relative policy optimization (GRPO) have demonstrated strong capabilities across various fields. However, applying LLMs to ship trajectory prediction remains largely unexplored. In this paper, we propose ShipTraj-R1, a novel LLM-based framework that reformulates ship trajectory prediction as a text-to-text generation problem. (1) We design a dynamic prompt containing trajectory information about conflicting ships to guide the model to achieve adaptive chain-of-thought (CoT) reasoning. (2) We introduce a comprehensive rule-based reward mechanism to incentivize the reasoning format and prediction accuracy of the model. (3) Our ShipTraj-R1 is reinforced through the GRPO mechanism guided by domain-specific prompts and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMaritime Navigation and Safety · Maritime Transport Emissions and Efficiency · Multimodal Machine Learning Applications
