PoP-Net: Pose over Parts Network for Multi-Person 3D Pose Estimation from a Depth Image
Yuliang Guo, Zhong Li, Zekun Li, Xiangyu Du, Shuxue Quan, Yi Xu

TL;DR
PoP-Net is a real-time multi-person 3D pose estimation method from depth images that combines bottom-up part detection with top-down global pose estimation, introducing a new dataset for benchmarking.
Contribution
The paper introduces PoP-Net, a novel single-shot approach for multi-person 3D pose estimation from depth images, and presents the MP-3DHP dataset for training and evaluation.
Findings
Achieves state-of-the-art results on MP-3DHP and ITOP datasets.
Demonstrates efficiency advantages in multi-person processing.
Enables virtual avatar control from 3D joint positions.
Abstract
In this paper, a real-time method called PoP-Net is proposed to predict multi-person 3D poses from a depth image. PoP-Net learns to predict bottom-up part representations and top-down global poses in a single shot. Specifically, a new part-level representation, called Truncated Part Displacement Field (TPDF), is introduced which enables an explicit fusion process to unify the advantages of bottom-up part detection and global pose detection. Meanwhile, an effective mode selection scheme is introduced to automatically resolve the conflicting cases between global pose and part detections. Finally, due to the lack of high-quality depth datasets for developing multi-person 3D pose estimation, we introduce Multi-Person 3D Human Pose Dataset (MP-3DHP) as a new benchmark. MP-3DHP is designed to enable effective multi-person and background data augmentation in model training, and to evaluate 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
PoP-Net: Pose over Parts Network for Multi-Person 3D Pose Estimation from a Depth Image· youtube
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging
