Hyp-RL : Hyperparameter Optimization by Reinforcement Learning
Hadi S. Jomaa, Josif Grabocka, Lars Schmidt-Thieme

TL;DR
This paper introduces Hyp-RL, a reinforcement learning approach to hyperparameter optimization that learns to select hyperparameters sequentially, outperforming existing methods like Bayesian optimization across numerous datasets.
Contribution
The paper proposes modeling hyperparameter tuning as a sequential decision process and applying reinforcement learning, eliminating reliance on heuristic acquisition functions like in Bayesian optimization.
Findings
Hyp-RL outperforms state-of-the-art hyperparameter optimization methods.
The approach adapts to various datasets effectively.
Reinforcement learning improves hyperparameter selection efficiency.
Abstract
Hyperparameter tuning is an omnipresent problem in machine learning as it is an integral aspect of obtaining the state-of-the-art performance for any model. Most often, hyperparameters are optimized just by training a model on a grid of possible hyperparameter values and taking the one that performs best on a validation sample (grid search). More recently, methods have been introduced that build a so-called surrogate model that predicts the validation loss for a specific hyperparameter setting, model and dataset and then sequentially select the next hyperparameter to test, based on a heuristic function of the expected value and the uncertainty of the surrogate model called acquisition function (sequential model-based Bayesian optimization, SMBO). In this paper we model the hyperparameter optimization problem as a sequential decision problem, which hyperparameter to test next, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Bandit Algorithms Research · Data Stream Mining Techniques
