Monocular Real-time Full Body Capture with Inter-part Correlations
Yuxiao Zhou, Marc Habermann, Ikhsanul Habibie, Ayush Tewari, Christian, Theobalt, Feng Xu

TL;DR
This paper introduces a real-time monocular full body capture method that jointly estimates body, hand, and face models from a single image, leveraging multi-dataset training for better generalization and detailed face reconstruction.
Contribution
It presents a novel neural network architecture that exploits inter-part correlations for efficient, joint full body, hand, and face capture from monocular images, trained on separate datasets without requiring all annotations.
Findings
Achieves competitive accuracy on public benchmarks.
Provides more expressive 3D face geometry and color.
Operates significantly faster than previous methods.
Abstract
We present the first method for real-time full body capture that estimates shape and motion of body and hands together with a dynamic 3D face model from a single color image. Our approach uses a new neural network architecture that exploits correlations between body and hands at high computational efficiency. Unlike previous works, our approach is jointly trained on multiple datasets focusing on hand, body or face separately, without requiring data where all the parts are annotated at the same time, which is much more difficult to create at sufficient variety. The possibility of such multi-dataset training enables superior generalization ability. In contrast to earlier monocular full body methods, our approach captures more expressive 3D face geometry and color by estimating the shape, expression, albedo and illumination parameters of a statistical face model. Our method achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Human Pose and Action Recognition · Video Surveillance and Tracking Methods
