Universal Successor Features Approximators

Diana Borsa; Andr\'e Barreto; John Quan; Daniel Mankowitz; R\'emi; Munos; Hado van Hasselt; David Silver; Tom Schaul

arXiv:1812.07626·cs.LG·December 20, 2018·24 cites

Universal Successor Features Approximators

Diana Borsa, Andr\'e Barreto, John Quan, Daniel Mankowitz, R\'emi, Munos, Hado van Hasselt, David Silver, Tom Schaul

PDF

Open Access 2 Repos

TL;DR

This paper introduces Universal Successor Features Approximators (USFAs), a method combining UVFAs, SFs, and GPI to improve generalisation and transfer in reinforcement learning, demonstrated in complex 3D navigation tasks.

Contribution

The paper proposes USFAs, a novel approach that integrates UVFAs, SFs, and GPI to enhance RL agents' ability to generalise to unseen tasks and transfer skills efficiently.

Findings

01

USFAs improve generalisation to unseen tasks.

02

USFAs demonstrate effective transfer in 3D navigation.

03

The method scales well to complex environments.

Abstract

The ability of a reinforcement learning (RL) agent to learn about many reward functions at the same time has many potential benefits, such as the decomposition of complex tasks into simpler ones, the exchange of information between tasks, and the reuse of skills. We focus on one aspect in particular, namely the ability to generalise to unseen tasks. Parametric generalisation relies on the interpolation power of a function approximator that is given the task description as input; one of its most common form are universal value function approximators (UVFAs). Another way to generalise to new tasks is to exploit structure in the RL problem itself. Generalised policy improvement (GPI) combines solutions of previous tasks into a policy for the unseen task; this relies on instantaneous policy evaluation of old policies under the new reward function, which is made possible through successor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification