A Recipe for Unbounded Data Augmentation in Visual Reinforcement   Learning

Abdulaziz Almuzairee; Nicklas Hansen; Henrik I. Christensen

arXiv:2405.17416·cs.LG·July 17, 2024·2 cites

A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning

Abdulaziz Almuzairee, Nicklas Hansen, Henrik I. Christensen

PDF

Open Access 1 Repo

TL;DR

This paper introduces SADA, a generalized data augmentation recipe for visual reinforcement learning that enhances training stability and generalization across diverse augmentation types, extending beyond photometric methods.

Contribution

Proposes SADA, a new data augmentation approach that broadens the types of augmentations effective for visual RL, improving stability and generalization.

Findings

01

SADA significantly improves training stability in visual RL.

02

SADA enhances generalization across various tasks and augmentations.

03

The method outperforms prior approaches like SVEA on benchmark tasks.

Abstract

Q-learning algorithms are appealing for real-world applications due to their data-efficiency, but they are very prone to overfitting and training instabilities when trained from visual observations. Prior work, namely SVEA, finds that selective application of data augmentation can improve the visual generalization of RL agents without destabilizing training. We revisit its recipe for data augmentation, and find an assumption that limits its effectiveness to augmentations of a photometric nature. Addressing these limitations, we propose a generalized recipe, SADA, that works with wider varieties of augmentations. We benchmark its effectiveness on DMC-GB2 - our proposed extension of the popular DMControl Generalization Benchmark - as well as tasks from Meta-World and the Distracting Control Suite, and find that our method, SADA, greatly improves training stability and generalization of RL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aalmuzairee/dmcgb2
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors

MethodsSparse Evolutionary Training