Hyp-RL : Hyperparameter Optimization by Reinforcement Learning

Hadi S. Jomaa; Josif Grabocka; Lars Schmidt-Thieme

arXiv:1906.11527·cs.LG·June 28, 2019·34 cites

Hyp-RL : Hyperparameter Optimization by Reinforcement Learning

Hadi S. Jomaa, Josif Grabocka, Lars Schmidt-Thieme

PDF

Open Access 1 Repo

TL;DR

This paper introduces Hyp-RL, a reinforcement learning approach to hyperparameter optimization that learns to select hyperparameters sequentially, outperforming existing methods like Bayesian optimization across numerous datasets.

Contribution

The paper proposes modeling hyperparameter tuning as a sequential decision process and applying reinforcement learning, eliminating reliance on heuristic acquisition functions like in Bayesian optimization.

Findings

01

Hyp-RL outperforms state-of-the-art hyperparameter optimization methods.

02

The approach adapts to various datasets effectively.

03

Reinforcement learning improves hyperparameter selection efficiency.

Abstract

Hyperparameter tuning is an omnipresent problem in machine learning as it is an integral aspect of obtaining the state-of-the-art performance for any model. Most often, hyperparameters are optimized just by training a model on a grid of possible hyperparameter values and taking the one that performs best on a validation sample (grid search). More recently, methods have been introduced that build a so-called surrogate model that predicts the validation loss for a specific hyperparameter setting, model and dataset and then sequentially select the next hyperparameter to test, based on a heuristic function of the expected value and the uncertainty of the surrogate model called acquisition function (sequential model-based Bayesian optimization, SMBO). In this paper we model the hyperparameter optimization problem as a sequential decision problem, which hyperparameter to test next, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hadijomaa/HypRL
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Bandit Algorithms Research · Data Stream Mining Techniques