Spatio-temporal Tendency Reasoning for Human Body Pose and Shape Estimation from Videos
Boyang Zhang, SuPing Wu, Hu Cao, Kehua Ma, Pan Li, Lei Lin

TL;DR
This paper introduces a spatio-temporal tendency reasoning network that improves human pose and shape estimation from videos by effectively modeling temporal and spatial features, leading to more accurate and natural motion sequences.
Contribution
The paper proposes novel TTR and STE modules for hierarchical temporal and spatial feature reasoning, enhancing spatio-temporal feature extraction in human pose estimation.
Findings
Achieves competitive results on three large-scale datasets.
Effectively models temporal tendencies with hierarchical residual connections.
Enhances spatial features using frequency domain sensitivity.
Abstract
In this paper, we present a spatio-temporal tendency reasoning (STR) network for recovering human body pose and shape from videos. Previous approaches have focused on how to extend 3D human datasets and temporal-based learning to promote accuracy and temporal smoothing. Different from them, our STR aims to learn accurate and natural motion sequences in an unconstrained environment through temporal and spatial tendency and to fully excavate the spatio-temporal features of existing video data. To this end, our STR learns the representation of features in the temporal and spatial dimensions respectively, to concentrate on a more robust representation of spatio-temporal features. More specifically, for efficient temporal modeling, we first propose a temporal tendency reasoning (TTR) module. TTR constructs a time-dimensional hierarchical residual connection representation within a video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Advanced Vision and Imaging
MethodsResidual Connection
