Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning
Zhecheng Yuan, Guozheng Ma, Yao Mu, Bo Xia, Bo Yuan, Xueqian Wang,, Ping Luo, Huazhe Xu

TL;DR
This paper introduces Task-aware Lipschitz Data Augmentation (TLDA), a novel method for visual reinforcement learning that selectively augments task-irrelevant pixels to improve generalization and sample efficiency.
Contribution
The paper proposes TLDA, which identifies task-relevant pixels using Lipschitz constants and applies augmentation only to irrelevant pixels, enhancing RL training stability and performance.
Findings
TLDA outperforms previous methods on DeepMind Control, CARLA, and DeepMind Manipulation tasks.
TLDA improves sample efficiency and generalization in visual RL.
Selective augmentation of task-irrelevant pixels stabilizes training and boosts performance.
Abstract
One of the key challenges in visual Reinforcement Learning (RL) is to learn policies that can generalize to unseen environments. Recently, data augmentation techniques aiming at enhancing data diversity have demonstrated proven performance in improving the generalization ability of learned policies. However, due to the sensitivity of RL training, naively applying data augmentation, which transforms each pixel in a task-agnostic manner, may suffer from instability and damage the sample efficiency, thus further exacerbating the generalization performance. At the heart of this phenomenon is the diverged action distribution and high-variance value estimation in the face of augmented images. To alleviate this issue, we propose Task-aware Lipschitz Data Augmentation (TLDA) for visual RL, which explicitly identifies the task-correlated pixels with large Lipschitz constants, and only augments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Domain Adaptation and Few-Shot Learning · Advanced Fluorescence Microscopy Techniques
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
