Semantic Belief-State World Model for 3D Human Motion Prediction

Sarim Chaudhry

arXiv:2601.03517·cs.CV·January 8, 2026

Semantic Belief-State World Model for 3D Human Motion Prediction

Sarim Chaudhry

PDF

Open Access

TL;DR

This paper introduces SBWM, a probabilistic latent dynamical model for 3D human motion prediction that improves long-term stability and interpretability by explicitly modeling motion dynamics and intent on the human body manifold.

Contribution

The paper proposes a novel belief-state world model that separates motion dynamics from pose reconstruction, leading to more stable long-term human motion predictions.

Findings

01

Coherent long-horizon rollouts achieved

02

Competitive accuracy with lower computational cost

03

Explicit modeling of motion dynamics improves stability

Abstract

Human motion prediction has traditionally been framed as a sequence regression problem where models extrapolate future joint coordinates from observed pose histories. While effective over short horizons this approach does not separate observation reconstruction with dynamics modeling and offers no explicit representation of the latent causes governing motion. As a result, existing methods exhibit compounding drift, mean-pose collapse, and poorly calibrated uncertainty when rolled forward beyond the training regime. Here we propose a Semantic Belief-State World Model (SBWM) that reframes human motion prediction as latent dynamical simulation on the human body manifold. Rather than predicting poses directly, SBWM maintains a recurrent probabilistic belief state whose evolution is learned independently of pose reconstruction and explicitly aligned with the SMPL-X anatomical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation · Gait Recognition and Analysis