Latent Action World Models for Control with Unlabeled Trajectories

Marvin Alles; Xingyuan Zhang; Patrick van der Smagt; Philip Becker-Ehmck

arXiv:2512.10016·cs.LG·December 12, 2025

Latent Action World Models for Control with Unlabeled Trajectories

Marvin Alles, Xingyuan Zhang, Patrick van der Smagt, Philip Becker-Ehmck

PDF

Open Access

TL;DR

This paper introduces latent-action world models that effectively learn from both labeled and unlabeled data, improving control performance with fewer action-labeled samples by combining passive observations and active interactions.

Contribution

It proposes a novel latent-action representation that unifies action-conditioned and action-free data, enabling more efficient training of world models for control tasks.

Findings

01

Achieves strong performance on DeepMind Control Suite

02

Uses about ten times fewer action-labeled samples than baselines

03

Enables training on passive and interactive data simultaneously

Abstract

Inspired by how humans combine direct interaction with action-free experience (e.g., videos), we study world models that learn from heterogeneous data. Standard world models typically rely on action-conditioned trajectories, which limits effectiveness when action labels are scarce. We introduce a family of latent-action world models that jointly use action-conditioned and action-free data by learning a shared latent action representation. This latent space aligns observed control signals with actions inferred from passive observations, enabling a single dynamics model to train on large-scale unlabeled trajectories while requiring only a small set of action-labeled ones. We use the latent-action world model to learn a latent-action policy through offline reinforcement learning (RL), thereby bridging two traditionally separate domains: offline RL, which typically relies on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Reinforcement Learning in Robotics · Human Motion and Animation