PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning
Hojoon Lee, Hanseul Cho, Hyunseung Kim, Daehoon Gwak, Joonkee Kim,, Jaegul Choo, Se-Young Yun, Chulhee Yun

TL;DR
This paper introduces the PLASTIC algorithm that enhances input and label plasticity in reinforcement learning, leading to improved sample efficiency by preventing overfitting and maintaining adaptability.
Contribution
The paper identifies two aspects of plasticity in RL, proposes techniques to improve them, and develops the PLASTIC algorithm that achieves competitive results on standard benchmarks.
Findings
Smoother loss minima improve input plasticity.
Refined gradient propagation enhances label plasticity.
PLASTIC achieves competitive performance on Atari-100k and DeepMind Control Suite.
Abstract
In Reinforcement Learning (RL), enhancing sample efficiency is crucial, particularly in scenarios when data acquisition is costly and risky. In principle, off-policy RL algorithms can improve sample efficiency by allowing multiple updates per environment interaction. However, these multiple updates often lead the model to overfit to earlier interactions, which is referred to as the loss of plasticity. Our study investigates the underlying causes of this phenomenon by dividing plasticity into two aspects. Input plasticity, which denotes the model's adaptability to changing input data, and label plasticity, which denotes the model's adaptability to evolving input-output relationships. Synthetic experiments on the CIFAR-10 dataset reveal that finding smoother minima of loss landscape enhances input plasticity, whereas refined gradient propagation improves label plasticity. Leveraging these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
