Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network
Chen Li, Gim Hee Lee

TL;DR
This paper introduces a novel mixture density network approach to generate multiple plausible 3D human pose hypotheses from 2D joint data, addressing depth ambiguity and occlusion issues in monocular pose estimation.
Contribution
It presents a multimodal mixture density network that produces multiple 3D pose hypotheses, outperforming unimodal methods and demonstrating strong generalization across datasets.
Findings
Achieves state-of-the-art results on Human3.6M dataset.
Produces multiple consistent 3D pose hypotheses.
Demonstrates good generalization on MPII and MPI-INF-3DHP datasets.
Abstract
3D human pose estimation from a monocular image or 2D joints is an ill-posed problem because of depth ambiguity and occluded joints. We argue that 3D human pose estimation from a monocular input is an inverse problem where multiple feasible solutions can exist. In this paper, we propose a novel approach to generate multiple feasible hypotheses of the 3D pose from 2D joints.In contrast to existing deep learning approaches which minimize a mean square error based on an unimodal Gaussian distribution, our method is able to generate multiple feasible hypotheses of 3D pose based on a multimodal mixture density networks. Our experiments show that the 3D poses estimated by our approach from an input of 2D joints are consistent in 2D reprojections, which supports our argument that multiple solutions exist for the 2D-to-3D inverse problem. Furthermore, we show state-of-the-art performance on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Diabetic Foot Ulcer Assessment and Management · Video Surveillance and Tracking Methods
