Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates
Nicholas E. Corrado, Josiah P. Hanna

TL;DR
This paper investigates when dynamics-invariant data augmentations improve data efficiency in model-free reinforcement learning, highlighting the importance of state-action coverage and the augmented replay ratio.
Contribution
It identifies key factors like state-action coverage and replay ratio that influence the effectiveness of data augmentation in RL, providing practical guidelines.
Findings
Increasing state-action coverage greatly enhances data efficiency.
Lowering the augmented replay ratio improves learning outcomes.
Some tasks require a low replay ratio to be solvable.
Abstract
Recently, data augmentation (DA) has emerged as a method for leveraging domain knowledge to inexpensively generate additional data in reinforcement learning (RL) tasks, often yielding substantial improvements in data efficiency. While prior work has demonstrated the utility of incorporating augmented data directly into model-free RL updates, it is not well-understood when a particular DA strategy will improve data efficiency. In this paper, we seek to identify general aspects of DA responsible for observed learning improvements. Our study focuses on sparse-reward tasks with dynamics-invariant data augmentation functions, serving as an initial step towards a more general understanding of DA and its integration into RL training. Experimentally, we isolate three relevant aspects of DA: state-action coverage, reward density, and the number of augmented transitions generated per update (the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics
