TL;DR
This paper introduces a probabilistic planning method for human-robot interaction that models multiple possible human responses using deep generative models, enabling real-time decision making in complex, uncertain scenarios like traffic weaving.
Contribution
The paper develops a multimodal probabilistic framework using CVAEs for modeling human responses, allowing robots to plan effectively in highly uncertain, dynamic interactions.
Findings
Successfully models multimodal human responses in traffic scenarios
Enables real-time policy construction through parallel sampling
Demonstrates improved interaction handling in simulations
Abstract
This paper presents a method for constructing human-robot interaction policies in settings where multimodality, i.e., the possibility of multiple highly distinct futures, plays a critical role in decision making. We are motivated in this work by the example of traffic weaving, e.g., at highway on-ramps/off-ramps, where entering and exiting cars must swap lanes in a short distance---a challenging negotiation even for experienced drivers due to the inherent multimodal uncertainty of who will pass whom. Our approach is to learn multimodal probability distributions over future human actions from a dataset of human-human exemplars and perform real-time robot policy construction in the resulting environment model through massively parallel sampling of human responses to candidate robot action sequences. Direct learning of these distributions is made possible by recent advances in the theory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
