Task Relabelling for Multi-task Transfer using Successor Features
Martin Balla, Diego Perez-Liebana

TL;DR
This paper explores pre-training Successor Features in a resource collection environment without rewards, enabling zero-shot transfer to new tasks via task relabeling, which significantly enhances transfer performance.
Contribution
It introduces a novel task relabeling method for pre-training Successor Features, improving zero-shot transfer capabilities in reinforcement learning environments.
Findings
Pre-trained SFs transfer effectively to new tasks without additional training.
Task relabeling significantly improves transfer performance.
SFs can be pre-trained without reward signals in complex environments.
Abstract
Deep Reinforcement Learning has been very successful recently with various works on complex domains. Most works are concerned with learning a single policy that solves the target task, but is fixed in the sense that if the environment changes the agent is unable to adapt to it. Successor Features (SFs) proposes a mechanism that allows learning policies that are not tied to any particular reward function. In this work we investigate how SFs may be pre-trained without observing any reward in a custom environment that features resource collection, traps and crafting. After pre-training we expose the SF agents to various target tasks and see how well they can transfer to new tasks. Transferring is done without any further training on the SF agents, instead just by providing a task vector. For training the SFs we propose a task relabelling method which greatly improves the agent's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Data Stream Mining Techniques
