GRIm-RePR: Prioritising Generating Important Features for Pseudo-Rehearsal
Craig Atkinson, Brendan McCane, Lech Szymanski, Anthony Robins

TL;DR
This paper introduces GRIm-RePR, a method that enhances pseudo-rehearsal by prioritizing important features for task retention, significantly reducing forgetting in deep reinforcement learning on Atari games.
Contribution
It proposes a novel generator training approach with a second discriminator focusing on feature importance, and introduces Q-value normalization to further mitigate forgetting.
Findings
Improved generator reduces catastrophic forgetting in Atari reinforcement learning.
Second discriminator enhances feature importance in generated data.
Q-value normalization decreases interference between tasks.
Abstract
Pseudo-rehearsal allows neural networks to learn a sequence of tasks without forgetting how to perform in earlier tasks. Preventing forgetting is achieved by introducing a generative network which can produce data from previously seen tasks so that it can be rehearsed along side learning the new task. This has been found to be effective in both supervised and reinforcement learning. Our current work aims to further prevent forgetting by encouraging the generator to accurately generate features important for task retention. More specifically, the generator is improved by introducing a second discriminator into the Generative Adversarial Network which learns to classify between real and fake items from the intermediate activation patterns that they produce when fed through a continual learning agent. Using Atari 2600 games, we experimentally find that improving the generator can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics · Model Reduction and Neural Networks
