PoP-Net: Pose over Parts Network for Multi-Person 3D Pose Estimation   from a Depth Image

Yuliang Guo; Zhong Li; Zekun Li; Xiangyu Du; Shuxue Quan; Yi Xu

arXiv:2012.06734·cs.CV·November 29, 2021

PoP-Net: Pose over Parts Network for Multi-Person 3D Pose Estimation from a Depth Image

Yuliang Guo, Zhong Li, Zekun Li, Xiangyu Du, Shuxue Quan, Yi Xu

PDF

Open Access 1 Repo 1 Video

TL;DR

PoP-Net is a real-time multi-person 3D pose estimation method from depth images that combines bottom-up part detection with top-down global pose estimation, introducing a new dataset for benchmarking.

Contribution

The paper introduces PoP-Net, a novel single-shot approach for multi-person 3D pose estimation from depth images, and presents the MP-3DHP dataset for training and evaluation.

Findings

01

Achieves state-of-the-art results on MP-3DHP and ITOP datasets.

02

Demonstrates efficiency advantages in multi-person processing.

03

Enables virtual avatar control from 3D joint positions.

Abstract

In this paper, a real-time method called PoP-Net is proposed to predict multi-person 3D poses from a depth image. PoP-Net learns to predict bottom-up part representations and top-down global poses in a single shot. Specifically, a new part-level representation, called Truncated Part Displacement Field (TPDF), is introduced which enables an explicit fusion process to unify the advantages of bottom-up part detection and global pose detection. Meanwhile, an effective mode selection scheme is introduced to automatically resolve the conflicting cases between global pose and part detections. Finally, due to the lack of high-quality depth datasets for developing multi-person 3D pose estimation, we introduce Multi-Person 3D Human Pose Dataset (MP-3DHP) as a new benchmark. MP-3DHP is designed to enable effective multi-person and background data augmentation in model training, and to evaluate 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oppo-us-research/PoP-Net
pytorchOfficial

Videos

PoP-Net: Pose over Parts Network for Multi-Person 3D Pose Estimation from a Depth Image· youtube

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging