A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
Jacob Adkins, Michael Bowling, Adam White

TL;DR
This paper introduces an empirical methodology to evaluate how sensitive reinforcement learning algorithms are to hyperparameter tuning across different environments, highlighting the importance of hyperparameter robustness.
Contribution
The paper proposes a scalable, empirical approach for quantifying hyperparameter sensitivity in reinforcement learning algorithms across multiple environments.
Findings
Normalization variants of PPO show varying hyperparameter sensitivity.
Some performance improvements are due to increased hyperparameter tuning reliance.
The methodology enables better understanding of hyperparameter interactions.
Abstract
The performance of modern reinforcement learning algorithms critically relies on tuning ever-increasing numbers of hyperparameters. Often, small changes in a hyperparameter can lead to drastic changes in performance, and different environments require very different hyperparameter settings to achieve state-of-the-art performance reported in the literature. We currently lack a scalable and widely accepted approach to characterizing these complex interactions. This work proposes a new empirical methodology for studying, comparing, and quantifying the sensitivity of an algorithm's performance to hyperparameter tuning for a given set of environments. We then demonstrate the utility of this methodology by assessing the hyperparameter sensitivity of several commonly used normalization variants of PPO. The results suggest that several algorithmic performance improvements may, in fact, be a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsEvolutionary Algorithms and Applications
MethodsEntropy Regularization · Proximal Policy Optimization · Sparse Evolutionary Training
