Understanding when Dynamics-Invariant Data Augmentations Benefit   Model-Free Reinforcement Learning Updates

Nicholas E. Corrado; Josiah P. Hanna

arXiv:2310.17786·cs.LG·March 19, 2024·1 cites

Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates

Nicholas E. Corrado, Josiah P. Hanna

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates when dynamics-invariant data augmentations improve data efficiency in model-free reinforcement learning, highlighting the importance of state-action coverage and the augmented replay ratio.

Contribution

It identifies key factors like state-action coverage and replay ratio that influence the effectiveness of data augmentation in RL, providing practical guidelines.

Findings

01

Increasing state-action coverage greatly enhances data efficiency.

02

Lowering the augmented replay ratio improves learning outcomes.

03

Some tasks require a low replay ratio to be solvable.

Abstract

Recently, data augmentation (DA) has emerged as a method for leveraging domain knowledge to inexpensively generate additional data in reinforcement learning (RL) tasks, often yielding substantial improvements in data efficiency. While prior work has demonstrated the utility of incorporating augmented data directly into model-free RL updates, it is not well-understood when a particular DA strategy will improve data efficiency. In this paper, we seek to identify general aspects of DA responsible for observed learning improvements. Our study focuses on sparse-reward tasks with dynamics-invariant data augmentation functions, serving as an initial step towards a more general understanding of DA and its integration into RL training. Experimentally, we isolate three relevant aspects of DA: state-action coverage, reward density, and the number of augmented transitions generated per update (the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

badger-rl/understandingdataaugmentationforrl
pytorchOfficial

Videos

Understanding when Dynamics-Invariant Data Augmentations Benefit Model-free Reinforcement Learning Updates· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics