TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild

Vida Adeli; Mahsa Ehsanpour; Ian Reid; Juan Carlos Niebles; Silvio; Savarese; Ehsan Adeli; Hamid Rezatofighi

arXiv:2104.04029·cs.CV·July 7, 2022

TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild

Vida Adeli, Mahsa Ehsanpour, Ian Reid, Juan Carlos Niebles, Silvio, Savarese, Ehsan Adeli, Hamid Rezatofighi

PDF

TL;DR

TRiPOD is a novel graph attentional network-based method for joint human trajectory and pose forecasting that effectively models interactions and occlusions, outperforming previous approaches on new challenging benchmarks.

Contribution

The paper introduces TRiPOD, a new approach combining graph attention and message passing for joint trajectory and pose prediction, including occlusion handling and a new benchmark dataset.

Findings

01

TRiPOD outperforms prior methods on benchmark datasets.

02

Incorporates occlusion awareness in pose forecasting.

03

Provides new evaluation metrics for joint prediction accuracy.

Abstract

Joint forecasting of human trajectory and pose dynamics is a fundamental building block of various applications ranging from robotics and autonomous driving to surveillance systems. Predicting body dynamics requires capturing subtle information embedded in the humans' interactions with each other and with the objects present in the scene. In this paper, we propose a novel TRajectory and POse Dynamics (nicknamed TRiPOD) method based on graph attentional networks to model the human-human and human-object interactions both in the input space and the output space (decoded future output). The model is supplemented by a message passing interface over the graphs to fuse these different levels of interactions efficiently. Furthermore, to incorporate a real-world challenge, we propound to learn an indicator representing whether an estimated body joint is visible/invisible at each frame, e.g. due…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.