$\alpha$-Potential Games for Decentralized Control of Connected and Automated Vehicles
Xuan Di, Anran Hu, Zhexin Wang, Yufei Zhang

TL;DR
This paper introduces an $ extalpha$-potential game framework for decentralized control of connected and automated vehicles, enabling scalable, safe, and heterogeneous traffic management with collision avoidance.
Contribution
It proposes a novel $ extalpha$-potential game approach that simplifies computing Nash equilibria in large-scale, heterogeneous vehicle populations, overcoming limitations of existing mean-field methods.
Findings
The $ extalpha$-NE can be computed via decentralized control problems.
The framework effectively models collision and obstacle avoidance.
Scalable neural-network algorithms successfully find equilibria.
Abstract
Designing scalable and safe control strategies for large populations of connected and automated vehicles (CAVs) requires accounting for strategic interactions among heterogeneous agents under decentralized information. While dynamic games provide a natural modeling framework, computing Nash equilibria (NEs) in large-scale settings remains challenging, and existing mean-field game approximations rely on restrictive assumptions that fail to capture collision avoidance and heterogeneous behaviors. This paper proposes an -potential game framework for decentralized CAV control. We show that computing -NE reduces to solving a decentralized control problem, and derive tight bounds of the parameter based on interaction intensity and asymmetry. We further develop scalable policy gradient algorithms for computing -NEs using decentralized neural-network policies.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Adaptive Dynamic Programming Control · Reinforcement Learning in Robotics
