Trajectory Space Factorization for Deep Video-Based 3D Human Pose Estimation
Jiahao Lin, Gim Hee Lee

TL;DR
This paper introduces a novel deep learning framework for 3D human pose estimation from videos that employs matrix factorization to process all frames simultaneously, improving accuracy and robustness over existing RNN and CNN methods.
Contribution
The proposed approach uses matrix factorization to represent 3D poses across sequences, enabling concurrent processing of all frames and overcoming limitations of prior sequential models.
Findings
Achieves state-of-the-art results on benchmark datasets.
Effectively handles long video sequences.
Outperforms existing RNN and CNN-based methods.
Abstract
Existing deep learning approaches on 3d human pose estimation for videos are either based on Recurrent or Convolutional Neural Networks (RNNs or CNNs). However, RNN-based frameworks can only tackle sequences with limited frames because sequential models are sensitive to bad frames and tend to drift over long sequences. Although existing CNN-based temporal frameworks attempt to address the sensitivity and drift problems by concurrently processing all input frames in the sequence, the existing state-of-the-art CNN-based framework is limited to 3d pose estimation of a single frame from a sequential input. In this paper, we propose a deep learning-based framework that utilizes matrix factorization for sequential 3d human poses estimation. Our approach processes all input frames concurrently to avoid the sensitivity and drift problems, and yet outputs the 3d pose estimates for every frame in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Video Surveillance and Tracking Methods
MethodsDiscrete Cosine Transform
