TL;DR
This paper introduces a novel generative modeling approach for predicting multimodal, multi-human behaviors in complex, interactive scenarios using graphical models and deep learning, demonstrated on basketball trajectories.
Contribution
It develops a new method combining graphical models with conditional variational autoencoders to model and predict diverse human behaviors in multi-agent settings.
Findings
Effective modeling of multimodal human actions in complex scenarios
Improved prediction accuracy in multi-human trajectory forecasting
Demonstrated applicability to robotic and autonomous systems
Abstract
This work presents a methodology for modeling and predicting human behavior in settings with N humans interacting in highly multimodal scenarios (i.e. where there are many possible highly-distinct futures). A motivating example includes robots interacting with humans in crowded environments, such as self-driving cars operating alongside human-driven vehicles or human-robot collaborative bin packing in a warehouse. Our approach to model human behavior in such uncertain environments is to model humans in the scene as nodes in a graphical model, with edges encoding relationships between them. For each human, we learn a multimodal probability distribution over future actions from a dataset of multi-human interactions. Learning such distributions is made possible by recent advances in the theory of conditional variational autoencoders and deep learning approximations of probabilistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
