Improving Hyperparameter Optimization by Planning Ahead

Hadi S. Jomaa; Jonas Falkner; Lars Schmidt-Thieme

arXiv:2110.08028·cs.LG·October 18, 2021

Improving Hyperparameter Optimization by Planning Ahead

Hadi S. Jomaa, Jonas Falkner, Lars Schmidt-Thieme

PDF

Open Access

TL;DR

This paper introduces a planning-based transfer learning method for hyperparameter optimization that uses model predictive control with trajectory sampling, outperforming existing algorithms on multiple datasets.

Contribution

It proposes a novel transfer learning approach within model-based reinforcement learning, employing a look-ahead strategy for hyperparameter selection to improve optimization efficiency.

Findings

01

Outperforms state-of-the-art HPO algorithms on three meta-datasets.

02

Uses an ensemble of probabilistic models for trajectory sampling.

03

Employs a simple look-ahead policy for hyperparameter selection.

Abstract

Hyperparameter optimization (HPO) is generally treated as a bi-level optimization problem that involves fitting a (probabilistic) surrogate model to a set of observed hyperparameter responses, e.g. validation loss, and consequently maximizing an acquisition function using a surrogate model to identify good hyperparameter candidates for evaluation. The choice of a surrogate and/or acquisition function can be further improved via knowledge transfer across related tasks. In this paper, we propose a novel transfer learning approach, defined within the context of model-based reinforcement learning, where we represent the surrogate as an ensemble of probabilistic models that allows trajectory sampling. We further propose a new variant of model predictive control which employs a simple look-ahead strategy as a policy that optimizes a sequence of actions, representing hyperparameter candidates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference

MethodsHyper-parameter optimization