Long-Term Human Video Generation of Multiple Futures Using Poses
Naoya Fushishita, Antonio Tejero-de-Pablos, Yusuke Mukuta, Tatsuya, Harada

TL;DR
This paper introduces a novel method for long-term, multi-future human pose prediction from videos, utilizing adversarial learning with additional inputs to enhance diversity and realism, and generating future videos for practical applications.
Contribution
It proposes a new approach combining adversarial learning with latent codes and attraction points to predict diverse, long-term human poses and generate corresponding videos.
Findings
Outperforms state-of-the-art methods in realism, diversity, and accuracy.
Successfully predicts multiple long-term futures from a single input video.
Generates realistic future videos based on predicted human poses.
Abstract
Predicting future human behavior from an input human video is a useful task for applications such as autonomous driving and robotics. While most previous works predict a single future, multiple futures with different behavior can potentially occur. Moreover, if the predicted future is too short (e.g., less than one second), it may not be fully usable by a human or other systems. In this paper, we propose a novel method for future human pose prediction capable of predicting multiple long-term futures. This makes the predictions more suitable for real applications. Also, from the input video and the predicted human behavior, we generate future videos. First, from an input human video, we generate sequences of future human poses (i.e., the image coordinates of their body-joints) via adversarial learning. Adversarial learning suffers from mode collapse, which makes it difficult to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods
