MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
Farnoosh Koleini, Hongfei Xue, Ahmed Helmy, Pu Wang

TL;DR
MonoMSK is a hybrid framework that combines data-driven learning and physics-based simulation to achieve biomechanically realistic 3D human motion reconstruction, including both kinematics and kinetics, from monocular video.
Contribution
It introduces a novel physics-regulated inverse-forward loop with a differentiable simulation and a forward-inverse consistency loss for improved biomechanical fidelity.
Findings
Outperforms state-of-the-art in kinematic accuracy
Enables precise monocular kinetics estimation
Demonstrates effectiveness on multiple datasets
Abstract
Reconstructing biomechanically realistic 3D human motion - recovering both kinematics (motion) and kinetics (forces) - is a critical challenge. While marker-based systems are lab-bound and slow, popular monocular methods use oversimplified, anatomically inaccurate models (e.g., SMPL) and ignore physics, fundamentally limiting their biomechanical fidelity. In this work, we introduce MonoMSK, a hybrid framework that bridges data-driven learning and physics-based simulation for biomechanically realistic 3D human motion estimation from monocular video. MonoMSK jointly recovers both kinematics (motions) and kinetics (forces and torques) through an anatomically accurate musculoskeletal model. By integrating transformer-based inverse dynamics with differentiable forward kinematics and dynamics layers governed by ODE-based simulation, MonoMSK establishes a physics-regulated inverse-forward loop…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Prosthetics and Rehabilitation Robotics
