TL;DR
This paper introduces a learned contingency planner that uses high-dimensional observations and behavioral models to enable safe, adaptive decision-making in multi-agent scenarios like driving, outperforming noncontingent methods.
Contribution
We develop a tractable, end-to-end learned contingency planning method using autoregressive flow models for multi-agent decision-making from observational data.
Findings
Our method significantly outperforms noncontingent planning approaches.
Contingency planning improves safety and decision quality in multi-agent driving scenarios.
The approach is validated on a realistic CARLA driving benchmark.
Abstract
Humans have a remarkable ability to make decisions by accurately reasoning about future events, including the future behaviors and states of mind of other agents. Consider driving a car through a busy intersection: it is necessary to reason about the physics of the vehicle, the intentions of other drivers, and their beliefs about your own intentions. If you signal a turn, another driver might yield to you, or if you enter the passing lane, another driver might decelerate to give you room to merge in front. Competent drivers must plan how they can safely react to a variety of potential future behaviors of other agents before they make their next move. This requires contingency planning: explicitly planning a set of conditional actions that depend on the stochastic outcome of future events. In this work, we develop a general-purpose contingency planner that is learned end-to-end using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
