Long-Term Human Video Generation of Multiple Futures Using Poses

Naoya Fushishita; Antonio Tejero-de-Pablos; Yusuke Mukuta; Tatsuya; Harada

arXiv:1904.07538·cs.CV·June 2, 2021·1 cites

Long-Term Human Video Generation of Multiple Futures Using Poses

Naoya Fushishita, Antonio Tejero-de-Pablos, Yusuke Mukuta, Tatsuya, Harada

PDF

Open Access

TL;DR

This paper introduces a novel method for long-term, multi-future human pose prediction from videos, utilizing adversarial learning with additional inputs to enhance diversity and realism, and generating future videos for practical applications.

Contribution

It proposes a new approach combining adversarial learning with latent codes and attraction points to predict diverse, long-term human poses and generate corresponding videos.

Findings

01

Outperforms state-of-the-art methods in realism, diversity, and accuracy.

02

Successfully predicts multiple long-term futures from a single input video.

03

Generates realistic future videos based on predicted human poses.

Abstract

Predicting future human behavior from an input human video is a useful task for applications such as autonomous driving and robotics. While most previous works predict a single future, multiple futures with different behavior can potentially occur. Moreover, if the predicted future is too short (e.g., less than one second), it may not be fully usable by a human or other systems. In this paper, we propose a novel method for future human pose prediction capable of predicting multiple long-term futures. This makes the predictions more suitable for real applications. Also, from the input video and the predicted human behavior, we generate future videos. First, from an input human video, we generate sequences of future human poses (i.e., the image coordinates of their body-joints) via adversarial learning. Adversarial learning suffers from mode collapse, which makes it difficult to generate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods