TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild
Vida Adeli, Mahsa Ehsanpour, Ian Reid, Juan Carlos Niebles, Silvio, Savarese, Ehsan Adeli, Hamid Rezatofighi

TL;DR
TRiPOD is a novel graph attentional network-based method for joint human trajectory and pose forecasting that effectively models interactions and occlusions, outperforming previous approaches on new challenging benchmarks.
Contribution
The paper introduces TRiPOD, a new approach combining graph attention and message passing for joint trajectory and pose prediction, including occlusion handling and a new benchmark dataset.
Findings
TRiPOD outperforms prior methods on benchmark datasets.
Incorporates occlusion awareness in pose forecasting.
Provides new evaluation metrics for joint prediction accuracy.
Abstract
Joint forecasting of human trajectory and pose dynamics is a fundamental building block of various applications ranging from robotics and autonomous driving to surveillance systems. Predicting body dynamics requires capturing subtle information embedded in the humans' interactions with each other and with the objects present in the scene. In this paper, we propose a novel TRajectory and POse Dynamics (nicknamed TRiPOD) method based on graph attentional networks to model the human-human and human-object interactions both in the input space and the output space (decoded future output). The model is supplemented by a message passing interface over the graphs to fuse these different levels of interactions efficiently. Furthermore, to incorporate a real-world challenge, we propound to learn an indicator representing whether an estimated body joint is visible/invisible at each frame, e.g. due…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
