Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules   and Training Stages

Guozheng Ma; Lu Li; Sen Zhang; Zixuan Liu; Zhen Wang; Yixin Chen; Li; Shen; Xueqian Wang; Dacheng Tao

arXiv:2310.07418·cs.LG·May 21, 2024·1 cites

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages

Guozheng Ma, Lu Li, Sen Zhang, Zixuan Liu, Zhen Wang, Yixin Chen, Li, Shen, Xueqian Wang, Dacheng Tao

PDF

Open Access 1 Repo

TL;DR

This paper investigates the factors affecting plasticity in visual reinforcement learning, highlighting the importance of data augmentation and critic plasticity, and introduces an adaptive replay ratio strategy to improve sample efficiency.

Contribution

It provides a systematic empirical analysis of plasticity in VRL and proposes an adaptive replay ratio method to enhance training efficiency.

Findings

01

Data augmentation is crucial for maintaining plasticity.

02

Critic's plasticity loss is the main bottleneck in training.

03

Adaptive replay ratio improves sample efficiency and prevents catastrophic plasticity loss.

Abstract

Plasticity, the ability of a neural network to evolve with new data, is crucial for high-performance and sample-efficient visual reinforcement learning (VRL). Although methods like resetting and regularization can potentially mitigate plasticity loss, the influences of various components within the VRL framework on the agent's plasticity are still poorly understood. In this work, we conduct a systematic empirical exploration focusing on three primary underexplored facets and derive the following insightful conclusions: (1) data augmentation is essential in maintaining plasticity; (2) the critic's plasticity loss serves as the principal bottleneck impeding efficient training; and (3) without timely intervention to recover critic's plasticity in the early stages, its loss becomes catastrophic. These insights suggest a novel strategy to address the high replay ratio (RR) dilemma, where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Guozheng-Ma/Adaptive-Replay-Ratio
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Neural Networks and Reservoir Computing