Convolutional Sequence to Sequence Model for Human Dynamics
Chen Li, Zhen Zhang, Wee Sun Lee, Gim Hee Lee

TL;DR
This paper introduces a convolutional neural network-based model for human motion prediction that effectively captures spatial and temporal correlations, outperforming existing methods on standard datasets.
Contribution
The paper proposes a novel CNN-based hierarchical model with long-term and short-term encoders for improved human motion prediction.
Findings
Outperforms state-of-the-art on Human3.6M dataset
Achieves more accurate motion predictions
Captures both invariant and dynamic motion features
Abstract
Human motion modeling is a classic problem in computer vision and graphics. Challenges in modeling human motion include high dimensional prediction as well as extremely complicated dynamics.We present a novel approach to human motion modeling based on convolutional neural networks (CNN). The hierarchical structure of CNN makes it capable of capturing both spatial and temporal correlations effectively. In our proposed approach,a convolutional long-term encoder is used to encode the whole given motion sequence into a long-term hidden variable, which is used with a decoder to predict the remainder of the sequence. The decoder itself also has an encoder-decoder structure, in which the short-term encoder encodes a shorter sequence to a short-term hidden variable, and the spatial decoder maps the long and short-term hidden variable to motion predictions. By using such a model, we are able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Human Motion and Animation
