Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems
Iman Sharifi, Alex Zongo, Peng Wei

TL;DR
This paper explores fine-tuning large language models for tactical deconfliction of small unmanned aerial systems, improving safety and decision accuracy in complex airspace scenarios.
Contribution
It introduces a simulation-based data generation pipeline and applies parameter-efficient fine-tuning strategies to enhance LLM decision-making in air traffic control tasks.
Findings
Supervised LoRA fine-tuning improves decision accuracy and safety.
Fine-tuned LLMs reduce near mid-air collisions.
GRPO enhances coordination but is less robust with heterogeneous policies.
Abstract
The growing deployment of small Unmanned Aerial Systems (sUASs) in low-altitude airspaces has increased the need for reliable tactical deconfliction under safety-critical constraints. Tactical deconfliction involves short-horizon decision-making in dense, partially observable, and heterogeneous multi-agent environments, where both cooperative separation assurance and operational efficiency must be maintained. While Large Language Models (LLMs) exhibit strong reasoning capabilities, their direct application to air traffic control remains limited by insufficient domain grounding and unpredictable output inconsistency. This paper investigates LLMs as decision-makers in cooperative multi-agent tactical deconfliction using fine-tuning strategies that align model outputs to human operator heuristics. We propose a simulation-to-language data generation pipeline based on the BlueSky air traffic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
