Deep Autoencoder for Combined Human Pose Estimation and body Model Upscaling
Matthew Trumble, Andrew Gilbert, Adrian Hilton, John Collomosse

TL;DR
This paper introduces a deep autoencoder that simultaneously estimates 3D human pose and body shape from sparse camera views, upscales volumetric data, and operates in real-time for potential human behavior monitoring.
Contribution
It presents a novel symmetric convolutional autoencoder with dual loss for joint pose estimation and body shape upscaling from limited camera views.
Findings
Achieves real-time inference at 25 fps.
Upscales volumetric data by 4 times while maintaining accuracy.
Provides high-fidelity 3D human pose and shape estimation.
Abstract
We present a method for simultaneously estimating 3D human pose and body shape from a sparse set of wide-baseline camera views. We train a symmetric convolutional autoencoder with a dual loss that enforces learning of a latent representation that encodes skeletal joint positions, and at the same time learns a deep representation of volumetric body shape. We harness the latter to up-scale input volumetric data by a factor of , whilst recovering a 3D estimate of joint positions with equal or greater accuracy than the state of the art. Inference runs in real-time (25 fps) and has the potential for passive human behaviour monitoring where there is a requirement for high fidelity estimation of human body shape and pose.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Video Surveillance and Tracking Methods
MethodsSolana Customer Service Number +1-833-534-1729
