LUCIDGames: Online Unscented Inverse Dynamic Games for Adaptive Trajectory Prediction and Planning
Simon Le Cleac'h, Mac Schwager, Zachary Manchester

TL;DR
LUCIDGames introduces a real-time inverse optimal control algorithm that estimates other agents' objectives online and integrates this into a game-theoretic planner for autonomous robots, enhancing performance in dynamic scenarios.
Contribution
It presents a novel recursive parameter-estimation framework using UKF for real-time inverse optimal control in a receding-horizon planner, without requiring agent communication.
Findings
Achieves 40 Hz real-time performance in complex driving scenarios.
Improves robot planning performance over existing methods.
Effectively estimates other agents' objectives online without explicit communication.
Abstract
Existing game-theoretic planning methods assume that the robot knows the objective functions of the other agents a priori while, in practical scenarios, this is rarely the case. This paper introduces LUCIDGames, an inverse optimal control algorithm that is able to estimate the other agents' objective functions in real time, and incorporate those estimates online into a receding-horizon game-theoretic planner. LUCIDGames solves the inverse optimal control problem by recasting it in a recursive parameter-estimation framework. LUCIDGames uses an unscented Kalman filter (UKF) to iteratively update a Bayesian estimate of the other agents' cost function parameters, improving that estimate online as more data is gathered from the other agents' observed trajectories. The planner then takes account of the uncertainty in the Bayesian parameter estimates of other agents by planning a trajectory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Robotic Path Planning Algorithms · Reinforcement Learning in Robotics
