TL;DR
This paper introduces coverage-based sampling designs, like Poisson disk sampling, to improve exploration in machine learning tasks such as sample mining and hyper-parameter optimization, demonstrating superior performance over traditional methods.
Contribution
It develops a parameterized family of coverage-based designs with provably better coverage and algorithms for effective sample synthesis tailored for ML applications.
Findings
Coverage-based designs outperform discrepancy-based samples in exploration.
The proposed methods improve hyper-parameter optimization efficiency.
Experiments show consistent gains in sample mining and Bayesian optimization.
Abstract
Sampling one or more effective solutions from large search spaces is a recurring idea in machine learning, and sequential optimization has become a popular solution. Typical examples include data summarization, sample mining for predictive modeling and hyper-parameter optimization. Existing solutions attempt to adaptively trade-off between global exploration and local exploitation, wherein the initial exploratory sample is critical to their success. While discrepancy-based samples have become the de facto approach for exploration, results from computer graphics suggest that coverage-based designs, e.g. Poisson disk sampling, can be a superior alternative. In order to successfully adopt coverage-based sample designs to ML applications, which were originally developed for 2-d image analysis, we propose fundamental advances by constructing a parameterized family of designs with provably…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
