TL;DR
This paper introduces AVATAR, a surrogate model that predicts the validity of machine learning pipelines without execution, significantly speeding up pipeline optimization and improving results in automated machine learning systems.
Contribution
The paper presents a novel surrogate model, AVATAR, which evaluates ML pipeline validity efficiently, enabling faster and more effective pipeline optimization in AutoML.
Findings
AVATAR reduces time spent on invalid pipelines.
Using AVATAR improves the quality of optimized pipelines.
Integration with SMAC yields better solutions than standard methods.
Abstract
Automated machine learning pipeline (ML) composition and optimisation aim at automating the process of finding the most promising ML pipelines within allocated resources (i.e., time, CPU and memory). Existing methods, such as Bayesian-based and genetic-based optimisation, which are implemented in Auto-Weka, Auto-sklearn and TPOT, evaluate pipelines by executing them. Therefore, the pipeline composition and optimisation of these methods frequently require a tremendous amount of time that prevents them from exploring complex pipelines to find better predictive models. To further explore this research challenge, we have conducted experiments showing that many of the generated pipelines are invalid in the first place, and attempting to execute them is a waste of time and resources. To address this issue, we propose a novel method to evaluate the validity of ML pipelines, without their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
