Transformer-based Multi-agent Reinforcement Learning for Separation Assurance in Structured and Unstructured Airspaces

Arsyi Aziz; Peng Wei

arXiv:2601.04401·cs.RO·January 9, 2026

Transformer-based Multi-agent Reinforcement Learning for Separation Assurance in Structured and Unstructured Airspaces

Arsyi Aziz, Peng Wei

PDF

Open Access

TL;DR

This paper introduces a transformer-based multi-agent reinforcement learning approach for aircraft separation assurance that generalizes well across different airspace structures, ensuring safety and efficiency.

Contribution

It recasts the MARL problem in a relative polar state space and trains a transformer encoder, improving adaptability and scalability for diverse airspace configurations.

Findings

01

Single encoder configuration outperforms deeper variants.

02

Near-zero mid-air collision rates achieved.

03

Outperforms baseline attention-only model.

Abstract

Conventional optimization-based metering depends on strict adherence to precomputed schedules, which limits the flexibility required for the stochastic operations of Advanced Air Mobility (AAM). In contrast, multi-agent reinforcement learning (MARL) offers a decentralized, adaptive framework that can better handle uncertainty, required for safe aircraft separation assurance. Despite this advantage, current MARL approaches often overfit to specific airspace structures, limiting their adaptability to new configurations. To improve generalization, we recast the MARL problem in a relative polar state space and train a transformer encoder model across diverse traffic patterns and intersection angles. The learned model provides speed advisories to resolve conflicts while maintaining aircraft near their desired cruising speeds. In our experiments, we evaluated encoder depths of 1, 2, and 3…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAir Traffic Management and Optimization · Aerospace and Aviation Technology · Adversarial Robustness in Machine Learning