Value-Decomposed Reinforcement Learning Framework for Taxiway Routing with Hierarchical Conflict-Aware Observations

Shizhong Zhou; Haifeng Liu; Zheng Zhang; Shiyu Zhang; Bo Yang; and Yi Lin

arXiv:2605.08754·cs.AI·May 13, 2026

Value-Decomposed Reinforcement Learning Framework for Taxiway Routing with Hierarchical Conflict-Aware Observations

Shizhong Zhou, Haifeng Liu, Zheng Zhang, Shiyu Zhang, Bo Yang, and Yi Lin

PDF

TL;DR

This paper introduces CaTR, a reinforcement learning framework for real-time taxiway routing that effectively balances safety and efficiency by encoding conflict-aware traffic information.

Contribution

It proposes a hierarchical traffic representation and value-decomposed RL strategy to improve safety and efficiency in airport surface operations.

Findings

01

CaTR outperforms baseline methods in safety-efficiency trade-offs.

02

CaTR maintains practical runtime across different traffic densities.

03

The framework effectively encodes downstream conflict information.

Abstract

Taxiway routing and on-surface conflict avoidance are coupled safety-critical decision problems in airport surface operations. Existing planning and optimization methods are often limited by online computational cost, while reinforcement learning methods may struggle to represent downstream traffic conflicts and balance multiple objectives. This paper presents Conflict-aware Taxiway Routing (CaTR), a reinforcement learning framework for real-time multi-aircraft taxiway routing. CaTR constructs a grid-based airport surface environment with action masking, introduces a hierarchical foresight traffic representation to encode current and downstream conflict-related traffic conditions, and adopts a value-decomposed reinforcement learning strategy to prioritize sparse but safety-critical objectives. Experiments are conducted on a realistic environment based on Changsha Huanghua International…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.