Modeling All Response Surfaces in One for Conditional Search Spaces
Jiaxing Li, Wei Liu, Chao Xue, Yibing Zhan, Xiaoxing Wang, Weifeng, Liu, Dacheng Tao

TL;DR
This paper introduces a novel Bayesian Optimization method that models all response surfaces in a single framework using self-attention, effectively handling dependencies in conditional hyperparameter spaces for improved AutoML performance.
Contribution
It proposes a structure-aware hyperparameter embedding and an attention-based feature extractor to unify modeling of multiple subspaces with a single Gaussian Process, capturing inter-subspace relationships.
Findings
Improves BO efficiency in conditional search spaces.
Demonstrates superior performance on real-world tasks.
Outperforms existing methods on HPO-B benchmark.
Abstract
Bayesian Optimization (BO) is a sample-efficient black-box optimizer commonly used in search spaces where hyperparameters are independent. However, in many practical AutoML scenarios, there will be dependencies among hyperparameters, forming a conditional search space, which can be partitioned into structurally distinct subspaces. The structure and dimensionality of hyperparameter configurations vary across these subspaces, challenging the application of BO. Some previous BO works have proposed solutions to develop multiple Gaussian Process models in these subspaces. However, these approaches tend to be inefficient as they require a substantial number of observations to guarantee each GP's performance and cannot capture relationships between hyperparameters across different subspaces. To address these issues, this paper proposes a novel approach to model the response surfaces of all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTeaching and Learning Programming · Advanced Multi-Objective Optimization Algorithms · Optimization and Search Problems
MethodsGaussian Process
