Hyperparameters in Contextual RL are Highly Situational

Theresa Eimer; Carolin Benjamins; Marius Lindauer

arXiv:2212.10876·cs.LG·December 22, 2022·1 cites

Hyperparameters in Contextual RL are Highly Situational

Theresa Eimer, Carolin Benjamins, Marius Lindauer

PDF

Open Access 1 Repo

TL;DR

This paper investigates how hyperparameters in contextual reinforcement learning are highly environment-specific, affecting stability and requiring different configurations based on environmental context, which complicates optimization.

Contribution

It demonstrates that hyperparameters in contextual RL depend heavily on environmental factors and that hyperparameter optimization varies in difficulty across different settings.

Findings

01

Hyperparameters vary with environmental context.

02

Optimizing hyperparameters is more challenging in contextual RL.

03

Environmental awareness improves hyperparameter tuning.

Abstract

Although Reinforcement Learning (RL) has shown impressive results in games and simulation, real-world application of RL suffers from its instability under changing environment conditions and hyperparameters. We give a first impression of the extent of this instability by showing that the hyperparameters found by automatic hyperparameter optimization (HPO) methods are not only dependent on the problem at hand, but even on how well the state describes the environment dynamics. Specifically, we show that agents in contextual RL require different hyperparameters if they are shown how environmental factors change. In addition, finding adequate hyperparameter configurations is not equally easy for both settings, further highlighting the need for research into how hyperparameters influence learning and generalization in RL.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

automl-private/crl_hpo
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSports Analytics and Performance · Artificial Intelligence in Games · Reinforcement Learning in Robotics