Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

Jashaswimalya Acharjee; Balaraman Ravindran

arXiv:2602.12643·cs.LG·February 16, 2026

Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

Jashaswimalya Acharjee, Balaraman Ravindran

PDF

Open Access

TL;DR

Unified Latent Dynamics (ULD) is a reinforcement learning method that combines model-free efficiency with model-based representational strengths using latent space embeddings, achieving high performance across diverse domains without planning overhead.

Contribution

The paper introduces ULD, a novel RL algorithm that unifies model-free and model-based approaches through latent space embeddings, with theoretical guarantees and broad empirical success.

Findings

01

Matches or exceeds specialized baselines in 80 environments

02

Supports a single hyperparameter set across diverse tasks

03

Achieves cross-domain competence with minimal tuning

Abstract

We present Unified Latent Dynamics (ULD), a novel reinforcement learning algorithm that unifies the efficiency of model-free methods with the representational strengths of model-based approaches, without incurring planning overhead. By embedding state-action pairs into a latent space in which the true value function is approximately linear, our method supports a single set of hyperparameters across diverse domains -- from continuous control with low-dimensional and pixel inputs to high-dimensional Atari games. We prove that, under mild conditions, the fixed point of our embedding-based temporal-difference updates coincides with that of a corresponding linear model-based value expansion, and we derive explicit error bounds relating embedding fidelity to value approximation quality. In practice, ULD employs synchronized updates of encoder, value, and policy networks, auxiliary losses for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Artificial Intelligence in Games