A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning
Arthur Juliani, Jordan T. Ash

TL;DR
This paper investigates plasticity loss in on-policy deep reinforcement learning, revealing its prevalence under domain shift and demonstrating that regenerative methods effectively mitigate it across various challenging environments.
Contribution
It provides the first extensive experimental analysis of plasticity loss in on-policy deep RL and identifies regenerative methods as effective solutions.
Findings
Plasticity loss is widespread under domain shift in on-policy deep RL.
Many existing mitigation methods fail or worsen performance in this setting.
Regenerative methods consistently reduce plasticity loss across diverse tasks.
Abstract
Continual learning with deep neural networks presents challenges distinct from both the fixed-dataset and convex continual learning regimes. One such challenge is plasticity loss, wherein a neural network trained in an online fashion displays a degraded ability to fit new tasks. This problem has been extensively studied in both supervised learning and off-policy reinforcement learning (RL), where a number of remedies have been proposed. Still, plasticity loss has received less attention in the on-policy deep RL setting. Here we perform an extensive set of experiments examining plasticity loss and a variety of mitigation methods in on-policy deep RL. We demonstrate that plasticity loss is pervasive under domain shift in this regime, and that a number of methods developed to resolve it in other settings fail, sometimes even performing worse than applying no intervention at all. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsOpen Source Software Innovations
MethodsSparse Evolutionary Training
