An Empirical Study of World Model Quantization
Zhongqian Fu, Tianyi Zhao, Kai Han, Hang Zhou, Xinghao Chen, Yunhe Wang

TL;DR
This paper systematically investigates the effects of post-training quantization on world models, revealing unique failure modes and providing insights for efficient deployment in resource-constrained environments.
Contribution
It offers the first comprehensive empirical analysis of PTQ impacts on world models, highlighting quantization challenges and practical guidelines for deployment.
Findings
Group-wise weight quantization stabilizes low-bit rollouts
Activation quantization benefits are inconsistent
Aggressive low-bit quantization degrades planning success
Abstract
World models learn an internal representation of environment dynamics, enabling agents to simulate and reason about future states within a compact latent space for tasks such as planning, prediction, and inference. However, running world models rely on hevay computational cost and memory footprint, making model quantization essential for efficient deployment. To date, the effects of post-training quantization (PTQ) on world models remain largely unexamined. In this work, we present a systematic empirical study of world model quantization using DINO-WM as a representative case, evaluating diverse PTQ methods under both weight-only and joint weight-activation settings. We conduct extensive experiments on different visual planning tasks across a wide range of bit-widths, quantization granularities, and planning horizons up to 50 iterations. Our results show that quantization effects in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · AI-based Problem Solving and Planning
