Rethinking Plasticity in Deep Reinforcement Learning

Zhiqiang He

arXiv:2603.21173·cs.LG·March 24, 2026

Rethinking Plasticity in Deep Reinforcement Learning

Zhiqiang He

PDF

Open Access

TL;DR

This paper introduces the Optimization-Centric Plasticity hypothesis, explaining plasticity loss in deep RL as a result of local optima entrenchment, and demonstrates how this understanding can guide strategies to restore network adaptability in non-stationary environments.

Contribution

It proposes a new optimization-based framework for understanding plasticity loss, linking neuron dormancy to zero-gradient states and explaining the effectiveness of parameter constraints.

Findings

01

Plasticity loss is task-specific and reversible.

02

Neuron dormancy correlates with zero-gradient states.

03

Parameter constraints help prevent local optima entrenchment.

Abstract

This paper investigates the fundamental mechanisms driving plasticity loss in deep reinforcement learning (RL), a critical challenge where neural networks lose their ability to adapt to non-stationary environments. While existing research often relies on descriptive metrics like dormant neurons or effective rank, these summaries fail to explain the underlying optimization dynamics. We propose the Optimization-Centric Plasticity (OCP) hypothesis, which posits that plasticity loss arises because optimal points from previous tasks become poor local optima for new tasks, trapping parameters during task transitions and hindering subsequent learning. We theoretically establish the equivalence between neuron dormancy and zero-gradient states, demonstrating that the absence of gradient signals is the primary driver of dormancy. Our experiments reveal that plasticity loss is highly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Reservoir Computing · Stochastic Gradient Optimization Techniques