Speeding Up Multi-Objective Hyperparameter Optimization by Task Similarity-Based Meta-Learning for the Tree-Structured Parzen Estimator
Shuhei Watanabe, Noor Awad, Masaki Onishi, Frank Hutter

TL;DR
This paper introduces a task similarity-based meta-learning approach to accelerate multi-objective hyperparameter optimization using TPE, significantly improving efficiency and achieving state-of-the-art results in deep learning benchmarks.
Contribution
It extends TPE's acquisition function with a novel task similarity measure, enabling effective meta-learning for multi-objective HPO, which was previously inapplicable.
Findings
Speeds up MO-TPE on tabular benchmarks
Achieves state-of-the-art multi-objective HPO performance
Wins AutoML 2022 competition on Transformers
Abstract
Hyperparameter optimization (HPO) is a vital step in improving performance in deep learning (DL). Practitioners are often faced with the trade-off between multiple criteria, such as accuracy and latency. Given the high computational needs of DL and the growing demand for efficient HPO, the acceleration of multi-objective (MO) optimization becomes ever more important. Despite the significant body of work on meta-learning for HPO, existing methods are inapplicable to MO tree-structured Parzen estimator (MO-TPE), a simple yet powerful MO-HPO algorithm. In this paper, we extend TPE's acquisition function to the meta-learning setting using a task similarity defined by the overlap of top domains between tasks. We also theoretically analyze and address the limitations of our task similarity. In the experiments, we demonstrate that our method speeds up MO-TPE on tabular HPO benchmarks and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications
MethodsHyper-parameter optimization
