GRIm-RePR: Prioritising Generating Important Features for   Pseudo-Rehearsal

Craig Atkinson; Brendan McCane; Lech Szymanski; Anthony Robins

arXiv:1911.11988·cs.LG·November 28, 2019

GRIm-RePR: Prioritising Generating Important Features for Pseudo-Rehearsal

Craig Atkinson, Brendan McCane, Lech Szymanski, Anthony Robins

PDF

Open Access

TL;DR

This paper introduces GRIm-RePR, a method that enhances pseudo-rehearsal by prioritizing important features for task retention, significantly reducing forgetting in deep reinforcement learning on Atari games.

Contribution

It proposes a novel generator training approach with a second discriminator focusing on feature importance, and introduces Q-value normalization to further mitigate forgetting.

Findings

01

Improved generator reduces catastrophic forgetting in Atari reinforcement learning.

02

Second discriminator enhances feature importance in generated data.

03

Q-value normalization decreases interference between tasks.

Abstract

Pseudo-rehearsal allows neural networks to learn a sequence of tasks without forgetting how to perform in earlier tasks. Preventing forgetting is achieved by introducing a generative network which can produce data from previously seen tasks so that it can be rehearsed along side learning the new task. This has been found to be effective in both supervised and reinforcement learning. Our current work aims to further prevent forgetting by encouraging the generator to accurately generate features important for task retention. More specifically, the generator is improved by introducing a second discriminator into the Generative Adversarial Network which learns to classify between real and fake items from the intermediate activation patterns that they produce when fed through a continual learning agent. Using Atari 2600 games, we experimentally find that improving the generator can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics · Model Reduction and Neural Networks