Invariant Transform Experience Replay: Data Augmentation for Deep   Reinforcement Learning

Yijiong Lin; Jiancong Huang; Matthieu Zimmer; Yisheng Guan; Juan; Rojas; Paul Weng

arXiv:1909.10707·cs.RO·August 27, 2020

Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning

Yijiong Lin, Jiancong Huang, Matthieu Zimmer, Yisheng Guan, Juan, Rojas, Paul Weng

PDF

1 Repo

TL;DR

This paper introduces Invariant Transform Experience Replay, a data augmentation framework exploiting task symmetries to improve learning efficiency in deep reinforcement learning for robotics.

Contribution

It presents a general framework with two techniques, Kaleidoscope and Goal-augmented Experience Replay, to leverage symmetries for faster RL training.

Findings

01

Significant speedups in learning rates and success rates in robotic tasks.

02

Achieved up to 13x speedup in certain tasks.

03

Successfully deployed policies on real robots.

Abstract

Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but its current application to robotics is currently hindered by high sample requirements. To alleviate this issue, we propose to exploit the symmetries present in robotic tasks. Intuitively, symmetries from observed trajectories define transformations that leave the space of feasible RL trajectories invariant and can be used to generate new feasible trajectories, which could be used for training. Based on this data augmentation idea, we formulate a general framework, called Invariant Transform Experience Replay that we present with two techniques: (i) Kaleidoscope Experience Replay exploits reflectional symmetries and (ii) Goal-augmented Experience Replay which takes advantage of lax goal definitions. In the Fetch tasks from OpenAI Gym, our experimental results show significant increases in learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YijiongLin/ITER_KER_GER
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Experience Replay