TL;DR
This paper introduces VolcanoML, a scalable AutoML framework that decomposes large search spaces into smaller ones, enabling more efficient exploration and outperforming existing systems like auto-sklearn.
Contribution
VolcanoML provides a systematic approach for search space decomposition and execution in AutoML, enhancing scalability and efficiency over prior methods.
Findings
Decomposition strategies significantly improve AutoML efficiency.
VolcanoML outperforms auto-sklearn in search space exploration.
The framework is extensible and supports complex AutoML pipelines.
Abstract
End-to-end AutoML has attracted intensive interests from both academia and industry which automatically searches for ML pipelines in a space induced by feature engineering, algorithm/model selection, and hyper-parameter tuning. Existing AutoML systems, however, suffer from scalability issues when applying to application domains with large, high-dimensional search spaces. We present VolcanoML, a scalable and extensible framework that facilitates systematic exploration of large AutoML search spaces. VolcanoML introduces and implements basic building blocks that decompose a large search space into smaller ones, and allows users to utilize these building blocks to compose an execution plan for the AutoML problem at hand. VolcanoML further supports a Volcano-style execution model -- akin to the one supported by modern database systems -- to execute the plan constructed. Our evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
