World Value Functions: Knowledge Representation for Learning and Planning
Geraud Nangue Tasse, Benjamin Rosman, Steven James

TL;DR
World Value Functions (WVFs) enable agents to learn goal-general representations that facilitate rapid adaptation to new tasks and implicit environment modeling for planning, improving learning efficiency.
Contribution
The paper introduces WVFs, a novel goal-oriented value function framework that encodes environment dynamics and supports flexible goal achievement and planning.
Findings
WVFs enable quick adaptation to new tasks by reusing learned representations.
WVFs implicitly encode environment transition dynamics.
Using WVFs improves sample efficiency in learning and planning.
Abstract
We propose world value functions (WVFs), a type of goal-oriented general value function that represents how to solve not just a given task, but any other goal-reaching task in an agent's environment. This is achieved by equipping an agent with an internal goal space defined as all the world states where it experiences a terminal transition. The agent can then modify the standard task rewards to define its own reward function, which provably drives it to learn how to achieve all reachable internal goals, and the value of doing so in the current task. We demonstrate two key benefits of WVFs in the context of learning and planning. In particular, given a learned WVF, an agent can compute the optimal policy in a new task by simply estimating the task's reward function. Furthermore, we show that WVFs also implicitly encode the transition dynamics of the environment, and so can be used to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Bayesian Modeling and Causal Inference · Machine Learning in Healthcare
