The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson, Samuel Neumann, Raksha Kumaraswamy, Martha White,, Adam White

TL;DR
This paper proposes a new benchmark for evaluating reinforcement learning algorithms across multiple environments using a single hyperparameter setting, emphasizing robustness and low computational cost, and demonstrates its effectiveness through empirical studies.
Contribution
The paper introduces the Cross-environment Hyperparameter Setting Benchmark, a novel empirical methodology for robust, low-cost evaluation of RL algorithms across diverse environments.
Findings
The benchmark is robust to statistical noise and consistent across repeated tests.
It is computationally inexpensive, enabling low-cost, statistically sound insights.
Empirical study shows no significant difference between noise types in DDPG exploration.
Abstract
This paper introduces a new empirical methodology, the Cross-environment Hyperparameter Setting Benchmark, that compares RL algorithms across environments using a single hyperparameter setting, encouraging algorithmic development which is insensitive to hyperparameters. We demonstrate that this benchmark is robust to statistical noise and obtains qualitatively similar results across repeated applications, even when using few samples. This robustness makes the benchmark computationally cheap to apply, allowing statistically sound insights at low cost. We demonstrate two example instantiations of the CHS, on a set of six small control environments (SC-CHS) and on the entire DM Control suite of 28 environments (DMC-CHS). Finally, to illustrate the applicability of the CHS to modern RL algorithms on challenging environments, we conduct a novel empirical study of an open question in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics · Data Stream Mining Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Sparse Evolutionary Training · Dense Connections · Weight Decay · Batch Normalization · Adam · Convolution · Experience Replay · Deep Deterministic Policy Gradient
