High-dimensional Bayesian Optimization with Group Testing
Erik Orm Hellsten, Carl Hvarfner, Leonard Papenmeier, Luigi Nardi

TL;DR
This paper introduces GTBO, a novel Bayesian optimization method that uses group testing to identify active variables in high-dimensional black-box functions, improving efficiency and interpretability.
Contribution
The paper proposes a new group testing-based algorithm for high-dimensional Bayesian optimization that identifies active variables to enhance optimization efficiency.
Findings
GTBO effectively identifies active variables in high-dimensional spaces.
GTBO outperforms existing methods on synthetic and real-world tasks.
GTBO helps practitioners understand which parameters influence the objective.
Abstract
Bayesian optimization is an effective method for optimizing expensive-to-evaluate black-box functions. High-dimensional problems are particularly challenging as the surrogate model of the objective suffers from the curse of dimensionality, which makes accurate modeling difficult. We propose a group testing approach to identify active variables to facilitate efficient optimization in these domains. The proposed algorithm, Group Testing Bayesian Optimization (GTBO), first runs a testing phase where groups of variables are systematically selected and tested on whether they influence the objective. To that end, we extend the well-established theory of group testing to functions of continuous ranges. In the second phase, GTBO guides optimization by placing more importance on the active dimensions. By exploiting the axis-aligned subspace assumption, GTBO is competitive against…
Peer Reviews
Decision·Submitted to ICLR 2024
1. The paper is mostly well written. 2. The baselines and relevant literature is well covered and duly introduced to the readers. 3. The results from the experiments suggest that the nmethod works well as compared to baselines both on simulated and real world datasets. 4. I think it is a strenght of the method that it combines the advantages of interpretibility which come along with feature selection compared to projection based approaches so the user gets to understand his data as he is perform
1. The model makes many assumptions that the features are relatively independent, since for highly correlated features, it might not be possible to break them into sets of active and inactive features without knowing their correlations beforehand. Assuming that the probabilities of dimensions to be active are independent, is rather a strong simplifying assumption and will not hold in many practical datasets and situations. 2. The paper does not list its own limitations properly. 3. Certain cho
1. The presentation of the group testing methodology clearly explains the mathematical foundation and practical implementation details of the proposed algorithm. 1. The computational tests are evaluated on several synthetic problems and also real-world benchmarks.
1. While this work is presented as a new BO algorithm, it is effectively a feature selection algorithm. The proposed group testing algorithm could select active dimensions to be optimized by any standard BO algorithm (e.g., ignoring the inactive dimensions). Likewise, a different feature selection method could be followed by the employed BO, which is relatively standard. 1. The motivation for the benchmark problems needs to be strengthened. The synthetic benchmarks have 2-8 active dimensions and
1. The proposed group testing idea is clear and easy to follow. 2. The experiment shows that using group testing can efficiently identify active variables.
1. I think the experiment comparison is not fair, where the best initial point of GTBO is always better than baselines, which gives additional advantage to GTBO. 2. I think the work of MCTS-VS[1] is also a HDBO method using variable selection, which is a similar work as GTBO and should be added into the baselines. [1] Song, Lei, et al. "Monte carlo tree search based variable selection for high dimensional bayesian optimization." Advances in Neural Information Processing Systems 35 (2022): 284
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSARS-CoV-2 detection and testing · Machine Learning and Algorithms · Advanced biosensing and bioanalysis techniques
