Forward-Backward Reinforcement Learning

Ashley D. Edwards; Laura Downs; James C. Davidson

arXiv:1803.10227·cs.LG·March 29, 2018·20 cites

Forward-Backward Reinforcement Learning

Ashley D. Edwards, Laura Downs, James C. Davidson

PDF

Open Access

TL;DR

This paper introduces a backward reinforcement learning approach that leverages goal states to improve training efficiency by predicting reverse steps, demonstrated to outperform standard methods in Gridworld and Towers of Hanoi.

Contribution

The paper proposes a novel backward reinforcement learning method that uses imagined reversal steps from goal states to accelerate training.

Findings

01

Better performance than standard DDQN in experiments

02

Effective in Gridworld and Towers of Hanoi environments

03

Leverages goal states to improve learning efficiency

Abstract

Goals for reinforcement learning problems are typically defined through hand-specified rewards. To design such problems, developers of learning algorithms must inherently be aware of what the task goals are, yet we often require agents to discover them on their own without any supervision beyond these sparse rewards. While much of the power of reinforcement learning derives from the concept that agents can learn with little guidance, this requirement greatly burdens the training process. If we relax this one restriction and endow the agent with knowledge of the reward function, and in particular of the goal, we can leverage backwards induction to accelerate training. To achieve this, we propose training a model to learn to take imagined reversal steps from known goal states. Rather than training an agent exclusively to determine how to reach a goal while moving forwards in time, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Advanced Bandit Algorithms Research