Transformer-based Multi-agent Reinforcement Learning for Separation Assurance in Structured and Unstructured Airspaces
Arsyi Aziz, Peng Wei

TL;DR
This paper introduces a transformer-based multi-agent reinforcement learning approach for aircraft separation assurance that generalizes well across different airspace structures, ensuring safety and efficiency.
Contribution
It recasts the MARL problem in a relative polar state space and trains a transformer encoder, improving adaptability and scalability for diverse airspace configurations.
Findings
Single encoder configuration outperforms deeper variants.
Near-zero mid-air collision rates achieved.
Outperforms baseline attention-only model.
Abstract
Conventional optimization-based metering depends on strict adherence to precomputed schedules, which limits the flexibility required for the stochastic operations of Advanced Air Mobility (AAM). In contrast, multi-agent reinforcement learning (MARL) offers a decentralized, adaptive framework that can better handle uncertainty, required for safe aircraft separation assurance. Despite this advantage, current MARL approaches often overfit to specific airspace structures, limiting their adaptability to new configurations. To improve generalization, we recast the MARL problem in a relative polar state space and train a transformer encoder model across diverse traffic patterns and intersection angles. The learned model provides speed advisories to resolve conflicts while maintaining aircraft near their desired cruising speeds. In our experiments, we evaluated encoder depths of 1, 2, and 3…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Traffic Management and Optimization · Aerospace and Aviation Technology · Adversarial Robustness in Machine Learning
