HaM-World: Soft-Hamiltonian World Models with Selective Memory for Planning
Haoyun Tang, Haodong Cui, Keyao Xu, Kun Wang, Zhandong Mei

TL;DR
HaM-World introduces a structured world model with selective memory and Hamiltonian dynamics to improve long-horizon planning and out-of-distribution robustness in model-based RL.
Contribution
It proposes a novel structured latent space combining Hamiltonian dynamics and selective memory, enhancing stability and generalization in world models.
Findings
Achieves highest Avg. AUC on four DeepMind Control Suite tasks.
Reduces long-horizon rollout error to 45% of baseline.
Outperforms in out-of-distribution scenarios with significant return gains.
Abstract
World models enable model-based planning through learned latent dynamics, but imagined rollouts become unstable as the planning horizon grows or the dynamics distribution shifts. We argue that this instability reflects two missing structures in planner-facing latents: history-conditioned memory for approximate Markov completeness, and geometric organization that separates configuration, momentum, and task semantics. We propose HaM-World (HMW), a structured world model that decomposes the latent state into a canonical (q, p) subspace and a context subspace c, while using Mamba selective state-space memory as the history-conditioned input to the same latent dynamics. Within this interface, (q, p) evolves through an energy-derived Hamiltonian vector field plus learnable residual/control dynamics, while c captures semantic, dissipative, and non-conservative factors. This gives the planner a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
