Semantic Belief-State World Model for 3D Human Motion Prediction
Sarim Chaudhry

TL;DR
This paper introduces SBWM, a probabilistic latent dynamical model for 3D human motion prediction that improves long-term stability and interpretability by explicitly modeling motion dynamics and intent on the human body manifold.
Contribution
The paper proposes a novel belief-state world model that separates motion dynamics from pose reconstruction, leading to more stable long-term human motion predictions.
Findings
Coherent long-horizon rollouts achieved
Competitive accuracy with lower computational cost
Explicit modeling of motion dynamics improves stability
Abstract
Human motion prediction has traditionally been framed as a sequence regression problem where models extrapolate future joint coordinates from observed pose histories. While effective over short horizons this approach does not separate observation reconstruction with dynamics modeling and offers no explicit representation of the latent causes governing motion. As a result, existing methods exhibit compounding drift, mean-pose collapse, and poorly calibrated uncertainty when rolled forward beyond the training regime. Here we propose a Semantic Belief-State World Model (SBWM) that reframes human motion prediction as latent dynamical simulation on the human body manifold. Rather than predicting poses directly, SBWM maintains a recurrent probabilistic belief state whose evolution is learned independently of pose reconstruction and explicitly aligned with the SMPL-X anatomical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Gait Recognition and Analysis
