Neural Monocular 3D Human Motion Capture with Physical Awareness
Soshi Shimada, Vladislav Golyanik, Weipeng Xu, Patrick, P\'erez, Christian Theobalt

TL;DR
This paper introduces Physionical, a neural system for markerless 3D human motion capture that incorporates physical constraints, achieving state-of-the-art results in challenging scenarios with physically plausible motions.
Contribution
It combines a neural controller, rigid body dynamics, and a novel optimization layer to enforce physical plausibility in 3D human motion capture from 2D keypoints.
Findings
Achieves state-of-the-art accuracy in challenging scenarios.
Produces smooth, physically consistent 3D motions in real-time.
Performs well on in-the-wild sequences unlike traditional benchmarks.
Abstract
We present a new trainable system for physically plausible markerless 3D human motion capture, which achieves state-of-the-art results in a broad range of challenging scenarios. Unlike most neural methods for human motion capture, our approach, which we dub physionical, is aware of physical and environmental constraints. It combines in a fully differentiable way several key innovations, i.e., 1. a proportional-derivative controller, with gains predicted by a neural network, that reduces delays even in the presence of fast motions, 2. an explicit rigid body dynamics model and 3. a novel optimisation layer that prevents physically implausible foot-floor penetration as a hard constraint. The inputs to our system are 2D joint keypoints, which are canonicalised in a novel way so as to reduce the dependency on intrinsic camera parameters -- both at train and test time. This enables more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Advanced Vision and Imaging
MethodsAttentive Walk-Aggregating Graph Neural Network
