Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for   Visual Reinforcement Learning

Zhecheng Yuan; Guozheng Ma; Yao Mu; Bo Xia; Bo Yuan; Xueqian Wang,; Ping Luo; Huazhe Xu

arXiv:2202.09982·cs.CV·February 23, 2022

Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning

Zhecheng Yuan, Guozheng Ma, Yao Mu, Bo Xia, Bo Yuan, Xueqian Wang,, Ping Luo, Huazhe Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Task-aware Lipschitz Data Augmentation (TLDA), a novel method for visual reinforcement learning that selectively augments task-irrelevant pixels to improve generalization and sample efficiency.

Contribution

The paper proposes TLDA, which identifies task-relevant pixels using Lipschitz constants and applies augmentation only to irrelevant pixels, enhancing RL training stability and performance.

Findings

01

TLDA outperforms previous methods on DeepMind Control, CARLA, and DeepMind Manipulation tasks.

02

TLDA improves sample efficiency and generalization in visual RL.

03

Selective augmentation of task-irrelevant pixels stabilizes training and boosts performance.

Abstract

One of the key challenges in visual Reinforcement Learning (RL) is to learn policies that can generalize to unseen environments. Recently, data augmentation techniques aiming at enhancing data diversity have demonstrated proven performance in improving the generalization ability of learned policies. However, due to the sensitivity of RL training, naively applying data augmentation, which transforms each pixel in a task-agnostic manner, may suffer from instability and damage the sample efficiency, thus further exacerbating the generalization performance. At the heart of this phenomenon is the diverged action distribution and high-variance value estimation in the face of augmented images. To alleviate this issue, we propose Task-aware Lipschitz Data Augmentation (TLDA) for visual RL, which explicitly identifies the task-correlated pixels with large Lipschitz constants, and only augments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gemcollector/TLDA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Domain Adaptation and Few-Shot Learning · Advanced Fluorescence Microscopy Techniques

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator