AutoWeka4MCPS-AVATAR: Accelerating Automated Machine Learning Pipeline   Composition and Optimisation

Tien-Dung Nguyen; Bogdan Gabrys; Katarzyna Musial

arXiv:2011.11846·cs.LG·November 25, 2020

AutoWeka4MCPS-AVATAR: Accelerating Automated Machine Learning Pipeline Composition and Optimisation

Tien-Dung Nguyen, Bogdan Gabrys, Katarzyna Musial

PDF

1 Repo

TL;DR

This paper introduces AVATAR, a surrogate model that predicts the validity of machine learning pipelines without execution, significantly speeding up pipeline optimization and improving results in automated machine learning systems.

Contribution

The paper presents a novel surrogate model, AVATAR, which evaluates ML pipeline validity efficiently, enabling faster and more effective pipeline optimization in AutoML.

Findings

01

AVATAR reduces time spent on invalid pipelines.

02

Using AVATAR improves the quality of optimized pipelines.

03

Integration with SMAC yields better solutions than standard methods.

Abstract

Automated machine learning pipeline (ML) composition and optimisation aim at automating the process of finding the most promising ML pipelines within allocated resources (i.e., time, CPU and memory). Existing methods, such as Bayesian-based and genetic-based optimisation, which are implemented in Auto-Weka, Auto-sklearn and TPOT, evaluate pipelines by executing them. Therefore, the pipeline composition and optimisation of these methods frequently require a tremendous amount of time that prevents them from exploring complex pipelines to find better predictive models. To further explore this research challenge, we have conducted experiments showing that many of the generated pipelines are invalid in the first place, and attempting to execute them is a waste of time and resources. To address this issue, we propose a novel method to evaluate the validity of ML pipelines, without their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UTS-AAi/autoweka
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.