Camera Motion Agnostic 3D Human Pose Estimation
Seong Hyun Kim, Sunwon Jeong, Sungbum Park, Ju Yong Chang

TL;DR
This paper introduces a novel method for 3D human pose estimation that is invariant to camera motion, enabling accurate pose and motion analysis in videos captured with moving cameras by predicting global motion sequences.
Contribution
The paper proposes a camera motion agnostic approach using a bidirectional GRU network to estimate global motion differences, improving 3D pose estimation in moving-camera scenarios.
Findings
Effective in moving-camera environments
Outperforms existing methods on 3DPW and synthetic datasets
Provides publicly available code and datasets
Abstract
Although the performance of 3D human pose and shape estimation methods has improved significantly in recent years, existing approaches typically generate 3D poses defined in camera or human-centered coordinate system. This makes it difficult to estimate a person's pure pose and motion in world coordinate system for a video captured using a moving camera. To address this issue, this paper presents a camera motion agnostic approach for predicting 3D human pose and mesh defined in the world coordinate system. The core idea of the proposed approach is to estimate the difference between two adjacent global poses (i.e., global motion) that is invariant to selecting the coordinate system, instead of the global pose coupled to the camera motion. To this end, we propose a network based on bidirectional gated recurrent units (GRUs) that predicts the global motion sequence from the local pose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Gait Recognition and Analysis
