PLASTIC: Improving Input and Label Plasticity for Sample Efficient   Reinforcement Learning

Hojoon Lee; Hanseul Cho; Hyunseung Kim; Daehoon Gwak; Joonkee Kim,; Jaegul Choo; Se-Young Yun; Chulhee Yun

arXiv:2306.10711·cs.LG·December 13, 2023·1 cites

PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning

Hojoon Lee, Hanseul Cho, Hyunseung Kim, Daehoon Gwak, Joonkee Kim,, Jaegul Choo, Se-Young Yun, Chulhee Yun

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces the PLASTIC algorithm that enhances input and label plasticity in reinforcement learning, leading to improved sample efficiency by preventing overfitting and maintaining adaptability.

Contribution

The paper identifies two aspects of plasticity in RL, proposes techniques to improve them, and develops the PLASTIC algorithm that achieves competitive results on standard benchmarks.

Findings

01

Smoother loss minima improve input plasticity.

02

Refined gradient propagation enhances label plasticity.

03

PLASTIC achieves competitive performance on Atari-100k and DeepMind Control Suite.

Abstract

In Reinforcement Learning (RL), enhancing sample efficiency is crucial, particularly in scenarios when data acquisition is costly and risky. In principle, off-policy RL algorithms can improve sample efficiency by allowing multiple updates per environment interaction. However, these multiple updates often lead the model to overfit to earlier interactions, which is referred to as the loss of plasticity. Our study investigates the underlying causes of this phenomenon by dividing plasticity into two aspects. Input plasticity, which denotes the model's adaptability to changing input data, and label plasticity, which denotes the model's adaptability to evolving input-output relationships. Synthetic experiments on the CIFAR-10 dataset reveal that finding smoother minima of loss landscape enhances input plasticity, whereas refined gradient propagation improves label plasticity. Leveraging these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dojeon-ai/plastic
jaxOfficial

Videos

PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning