PlayWorld: Learning Robot World Models from Autonomous Play

Tenny Yin; Zhiting Mei; Zhonghe Zheng; Miyu Yamane; David Wang; Jade Sceats; Samuel M. Bateman; Lihan Zha; Apurva Badithela; Ola Shorinwa; Anirudha Majumdar

arXiv:2603.09030·cs.RO·April 7, 2026

PlayWorld: Learning Robot World Models from Autonomous Play

Tenny Yin, Zhiting Mei, Zhonghe Zheng, Miyu Yamane, David Wang, Jade Sceats, Samuel M. Bateman, Lihan Zha, Apurva Badithela, Ola Shorinwa, Anirudha Majumdar

PDF

TL;DR

PlayWorld introduces an autonomous, self-play based pipeline for training high-fidelity, physically consistent robot world models that enhance manipulation, failure prediction, and reinforcement learning without human demonstrations.

Contribution

It is the first system capable of learning entirely from unsupervised robot self-play, enabling scalable data collection and modeling complex physical interactions.

Findings

01

High-quality, physically consistent predictions for contact-rich interactions.

02

Up to 40% improvement in failure prediction and policy evaluation.

03

65% increase in real-world policy success rates.

Abstract

Action-conditioned video models offer a promising path to building general-purpose robot simulators that can improve directly from data. Yet, despite training on large-scale robot datasets, current state-of-the-art video models still struggle to predict physically consistent robot-object interactions that are crucial in robotic manipulation. To close this gap, we present PlayWorld, a simple, scalable, and fully autonomous pipeline for training high-fidelity video world simulators from interaction experience. In contrast to prior approaches that rely on success-biased human demonstrations, PlayWorld is the first system capable of learning entirely from unsupervised robot self-play, enabling naturally scalable data collection while capturing complex, long-tailed physical interactions essential for modeling realistic object dynamics. Experiments across diverse manipulation tasks show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.