Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Guozheng Ma, Lu Li, Sen Zhang, Zixuan Liu, Zhen Wang, Yixin Chen, Li, Shen, Xueqian Wang, Dacheng Tao

TL;DR
This paper investigates the factors affecting plasticity in visual reinforcement learning, highlighting the importance of data augmentation and critic plasticity, and introduces an adaptive replay ratio strategy to improve sample efficiency.
Contribution
It provides a systematic empirical analysis of plasticity in VRL and proposes an adaptive replay ratio method to enhance training efficiency.
Findings
Data augmentation is crucial for maintaining plasticity.
Critic's plasticity loss is the main bottleneck in training.
Adaptive replay ratio improves sample efficiency and prevents catastrophic plasticity loss.
Abstract
Plasticity, the ability of a neural network to evolve with new data, is crucial for high-performance and sample-efficient visual reinforcement learning (VRL). Although methods like resetting and regularization can potentially mitigate plasticity loss, the influences of various components within the VRL framework on the agent's plasticity are still poorly understood. In this work, we conduct a systematic empirical exploration focusing on three primary underexplored facets and derive the following insightful conclusions: (1) data augmentation is essential in maintaining plasticity; (2) the critic's plasticity loss serves as the principal bottleneck impeding efficient training; and (3) without timely intervention to recover critic's plasticity in the early stages, its loss becomes catastrophic. These insights suggest a novel strategy to address the high replay ratio (RR) dilemma, where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Neural Networks and Reservoir Computing
