Learning Two-agent Motion Planning Strategies from Generalized Nash Equilibrium for Model Predictive Control
Hansung Kim, Edward L. Zhu, Chang Seok Lim, Francesco Borrelli

TL;DR
This paper presents a decentralized motion planning algorithm for two agents that leverages learned game-theoretic interactions within an MPC framework, enabling competitive and cooperative behaviors.
Contribution
It introduces IGT-MPC, combining neural network predictions of game outcomes with MPC for improved multi-agent motion planning.
Findings
Successfully demonstrated competitive and cooperative behaviors in simulated scenarios
Achieved real-time planning with learned value functions guiding agent interactions
Validated the approach on head-to-head racing and intersection navigation tasks
Abstract
We introduce an Implicit Game-Theoretic MPC (IGT-MPC), a decentralized algorithm for two-agent motion planning that uses a learned value function that predicts the game-theoretic interaction outcomes as the terminal cost-to-go function in a model predictive control (MPC) framework, guiding agents to implicitly account for interactions with other agents and maximize their reward. This approach applies to competitive and cooperative multi-agent motion planning problems which we formulate as constrained dynamic games. Given a constrained dynamic game, we randomly sample initial conditions and solve for the generalized Nash equilibrium (GNE) to generate a dataset of GNE solutions, computing the reward outcome of each game-theoretic interaction from the GNE. The data is used to train a simple neural network to predict the reward outcome, which we use as the terminal cost-to-go function in an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics
