Unsupervised Multi-Person 3D Human Pose Estimation From 2D Poses Alone
Peter Hardy, Hansung Kim

TL;DR
This paper introduces an unsupervised method for reconstructing multi-person 3D human poses from 2D poses alone, addressing perspective ambiguity by predicting camera angles and aligning poses in a shared 3D space.
Contribution
It proposes a novel approach that predicts camera elevation angles to enable accurate multi-person 3D pose estimation from 2D data without supervision.
Findings
Achieved accurate 3D reconstructions on CHI3D dataset
Introduced three new quantitative metrics for evaluation
Established a benchmark for future unsupervised multi-person 3D pose estimation
Abstract
Current unsupervised 2D-3D human pose estimation (HPE) methods do not work in multi-person scenarios due to perspective ambiguity in monocular images. Therefore, we present one of the first studies investigating the feasibility of unsupervised multi-person 2D-3D HPE from just 2D poses alone, focusing on reconstructing human interactions. To address the issue of perspective ambiguity, we expand upon prior work by predicting the cameras' elevation angle relative to the subjects' pelvis. This allows us to rotate the predicted poses to be level with the ground plane, while obtaining an estimate for the vertical offset in 3D between individuals. Our method involves independently lifting each subject's 2D pose to 3D, before combining them in a shared 3D coordinate system. The poses are then rotated and offset by the predicted elevation angle before being scaled. This by itself enables us to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging
