Optimal-state Dynamics Estimation for Physics-based Human Motion Capture from Videos
Cuong Le, Viktor Johansson, Manon Kok, Bastian Wandt

TL;DR
This paper introduces a novel online physics-based human motion capture method that uses a neural Kalman filter to improve the accuracy and physical plausibility of estimated human poses from videos.
Contribution
It proposes a selective physics integration approach with a neural Kalman filter for improved online human motion capture from monocular videos.
Findings
Outperforms state-of-the-art methods in physics-based human pose estimation
Produces more physically plausible and smooth human motions
Effectively balances kinematic observations with physics simulation in real-time
Abstract
Human motion capture from monocular videos has made significant progress in recent years. However, modern approaches often produce temporal artifacts, e.g. in form of jittery motion and struggle to achieve smooth and physically plausible motions. Explicitly integrating physics, in form of internal forces and exterior torques, helps alleviating these artifacts. Current state-of-the-art approaches make use of an automatic PD controller to predict torques and reaction forces in order to re-simulate the input kinematics, i.e. the joint angles of a predefined skeleton. However, due to imperfect physical models, these methods often require simplifying assumptions and extensive preprocessing of the input kinematics to achieve good performance. To this end, we propose a novel method to selectively incorporate the physics models with the kinematics observations in an online setting, inspired by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Advanced Vision and Imaging
