Pseudorehearsal in actor-critic agents

Marochko Vladimir; Leonard Johard; Manuel Mazzara

arXiv:1704.04912·cs.AI·April 18, 2017·2 cites

Pseudorehearsal in actor-critic agents

Marochko Vladimir, Leonard Johard, Manuel Mazzara

PDF

Open Access

TL;DR

This paper investigates whether pseudorehearsal techniques can mitigate catastrophic forgetting in actor-critic reinforcement learning agents, specifically in a pole balancing task, and compares different pseudorehearsal methods to enhance learning performance.

Contribution

It introduces the application of pseudorehearsal to actor-critic agents in reinforcement learning and compares various approaches to improve learning stability.

Findings

01

Pseudorehearsal can help prevent catastrophic forgetting in simple RL tasks.

02

Proper initialization of rehearsal parameters is crucial for effectiveness.

03

Different pseudorehearsal methods vary in performance.

Abstract

Catastrophic forgetting has a serious impact in reinforcement learning, as the data distribution is generally sparse and non-stationary over time. The purpose of this study is to investigate whether pseudorehearsal can increase performance of an actor-critic agent with neural-network based policy selection and function approximation in a pole balancing task and compare different pseudorehearsal approaches. We expect that pseudorehearsal assists learning even in such very simple problems, given proper initialization of the rehearsal parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Model Reduction and Neural Networks