Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision
Karttikeya Mangalam, Ehsan Adeli, Kuan-Hui Lee, Adrien Gaidon, Juan, Carlos Niebles

TL;DR
This paper introduces a unified framework for pedestrian locomotion forecasting that leverages noisy supervision and disentangles motion into subcomponents, achieving state-of-the-art results in egocentric settings.
Contribution
It proposes a novel hierarchical trajectory forecasting model that learns from noisy annotations and disentangles human motion into global and local components.
Findings
Achieves state-of-the-art accuracy in egocentric human locomotion prediction.
Effectively learns from noisy, automatically generated annotations.
Disentangles motion into trajectory and pose components for improved modeling.
Abstract
We tackle the problem of Human Locomotion Forecasting, a task for jointly predicting the spatial positions of several keypoints on the human body in the near future under an egocentric setting. In contrast to the previous work that aims to solve either the task of pose prediction or trajectory forecasting in isolation, we propose a framework to unify the two problems and address the practically useful task of pedestrian locomotion prediction in the wild. Among the major challenges in solving this task is the scarcity of annotated egocentric video datasets with dense annotations for pose, depth, or egomotion. To surmount this difficulty, we use state-of-the-art models to generate (noisy) annotations and propose robust forecasting models that can learn from this noisy supervision. We present a method to disentangle the overall pedestrian motion into easier to learn subparts by utilizing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
