Spatio-Temporal Encoding and Decoding-Based Method for Future Human   Activity Skeleton Synthesis

Tingyu Liu; Jun Huang; Chenyi Weng

arXiv:2407.05573·cs.CV·July 9, 2024

Spatio-Temporal Encoding and Decoding-Based Method for Future Human Activity Skeleton Synthesis

Tingyu Liu, Jun Huang, Chenyi Weng

PDF

Open Access

TL;DR

This paper introduces a spatio-temporal encoding-decoding approach for synthesizing future human activity skeletons, emphasizing efficiency and accuracy over traditional GAN-based methods.

Contribution

It presents a novel encoding-decoding framework that reduces computational costs and improves prediction accuracy for future skeleton sequences.

Findings

01

Outperforms existing algorithms in skeleton prediction accuracy

02

Generates sequences with smaller errors and fewer parameters

03

Enhances early activity prediction capabilities

Abstract

Inferring future activity information based on observed activity data is a crucial step to improve the accuracy of early activity prediction. Traditional methods based on generative adversarial networks(GAN) or joint learning frameworks can achieve good prediction accuracy under low observation ratios, but they usually have high computational costs. In view of this, this paper proposes a spatio-temporal encoding and decoding-based method for future human activity skeleton synthesis. Firstly, algorithms such as time control, discrete cosine transform, and low-pass filtering are used to cut or pad the skeleton sequences. Secondly, the encoder and decoder are responsible for extracting intermediate semantic encoding from observed skeleton sequences and inferring future sequences from the intermediate semantic encoding, respectively. Finally, joint displacement error, velocity error, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsContext-Aware Activity Recognition Systems