Accelerating Learning in Constructive Predictive Frameworks with the   Successor Representation

Craig Sherstan; Marlos C. Machado; Patrick M. Pilarski

arXiv:1803.09001·cs.LG·March 28, 2018

Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation

Craig Sherstan, Marlos C. Machado, Patrick M. Pilarski

PDF

TL;DR

This paper introduces the use of the successor representation (SR) to enhance learning speed and sample efficiency in constructive predictive frameworks based on general value functions (GVFs), especially in dynamic, real-world environments.

Contribution

The paper demonstrates that SR-based predictions can significantly accelerate learning and improve sample efficiency in incremental, continual learning scenarios involving GVFs.

Findings

01

SR improves learning speed in grid-world experiments.

02

SR enhances sample efficiency on a physical robot arm.

03

The approach supports incremental addition of predictions in dynamic environments.

Abstract

Here we propose using the successor representation (SR) to accelerate learning in a constructive knowledge system based on general value functions (GVFs). In real-world settings like robotics for unstructured and dynamic environments, it is infeasible to model all meaningful aspects of a system and its environment by hand due to both complexity and size. Instead, robots must be capable of learning and adapting to changes in their environment and task, incrementally constructing models from their own experience. GVFs, taken from the field of reinforcement learning (RL), are a way of modeling the world as predictive questions. One approach to such models proposes a massive network of interconnected and interdependent GVFs, which are incrementally added over time. It is reasonable to expect that new, incrementally added predictions can be learned more swiftly if the learning process…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.