Simple Noisy Environment Augmentation for Reinforcement Learning

Raad Khraishi; Ramin Okhrati

arXiv:2305.02882·cs.LG·May 5, 2023·1 cites

Simple Noisy Environment Augmentation for Reinforcement Learning

Raad Khraishi, Ramin Okhrati

PDF

Open Access 1 Repo

TL;DR

This paper introduces generic noise-based environment augmentation wrappers for reinforcement learning, enhancing exploration and data diversity, with experimental validation across multiple algorithms and environments, and provides open-source tools for practical use.

Contribution

The paper proposes a set of broad, noise-based augmentation wrappers for RL environments, including novel techniques and a noise rate hyperparameter, applicable to various algorithms and environments.

Findings

01

Augmentation wrappers improve RL performance across algorithms.

02

Noise rate hyperparameter effectively controls noise injection frequency.

03

Experimental results demonstrate enhanced exploration and training stability.

Abstract

Data augmentation is a widely used technique for improving model performance in machine learning, particularly in computer vision and natural language processing. Recently, there has been increasing interest in applying augmentation techniques to reinforcement learning (RL) problems, with a focus on image-based augmentation. In this paper, we explore a set of generic wrappers designed to augment RL environments with noise and encourage agent exploration and improve training data diversity which are applicable to a broad spectrum of RL algorithms and environments. Specifically, we concentrate on augmentations concerning states, rewards, and transition dynamics and introduce two novel augmentation techniques. In addition, we introduce a noise rate hyperparameter for control over the frequency of noise injection. We present experimental results on the impact of these wrappers on return…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ucl-ift/noisyenv
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Software Engineering Research · Evolutionary Algorithms and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Adam · Batch Normalization · Weight Decay · Experience Replay · Dense Connections · Deep Deterministic Policy Gradient