Tree-structured Policy Planning with Learned Behavior Models
Yuxiao Chen, Peter Karkus, Boris Ivanovic, Xinshuo Weng, and Marco, Pavone

TL;DR
This paper introduces Tree Policy Planning (TPP), a scalable, interpretable policy planning method for autonomous vehicles that accounts for multimodal agent behaviors and improves over existing approaches.
Contribution
We propose TPP, a novel tree-structured policy planner compatible with deep learning models, enabling multistage motion planning and interaction-aware decision making for AVs.
Findings
TPP scales effectively to realistic AV scenarios.
TPP significantly outperforms non-policy baselines.
Demonstrated success in real-world nuScenes dataset simulations.
Abstract
Autonomous vehicles (AVs) need to reason about the multimodal behavior of neighboring agents while planning their own motion. Many existing trajectory planners seek a single trajectory that performs well under \emph{all} plausible futures simultaneously, ignoring bi-directional interactions and thus leading to overly conservative plans. Policy planning, whereby the ego agent plans a policy that reacts to the environment's multimodal behavior, is a promising direction as it can account for the action-reaction interactions between the AV and the environment. However, most existing policy planners do not scale to the complexity of real autonomous vehicle applications: they are either not compatible with modern deep learning prediction models, not interpretable, or not able to generate high quality trajectories. To fill this gap, we propose Tree Policy Planning (TPP), a policy planner that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Transportation and Mobility Innovations
