Mining Robust Default Configurations for Resource-constrained AutoML

Moe Kayali; Chi Wang

arXiv:2202.09927·cs.LG·February 22, 2022·1 cites

Mining Robust Default Configurations for Resource-constrained AutoML

Moe Kayali, Chi Wang

PDF

Open Access

TL;DR

This paper introduces a zero-shot AutoML configuration selection method that mines diverse tasks to identify robust default configurations, improving performance across multiple datasets without online training.

Contribution

The paper presents a novel offline mining approach for selecting default AutoML configurations that generalize well to unseen tasks, enhancing efficiency and effectiveness.

Findings

01

Outperforms state-of-the-art on 62 datasets

02

Effective for warm-starting existing AutoML platforms

03

Recommends data-dependent default configurations

Abstract

Automatic machine learning (AutoML) is a key enabler of the mass deployment of the next generation of machine learning systems. A key desideratum for future ML systems is the automatic selection of models and hyperparameters. We present a novel method of selecting performant configurations for a given task by performing offline autoML and mining over a diverse set of tasks. By mining the training tasks, we can select a compact portfolio of configurations that perform well over a wide variety of tasks, as well as learn a strategy to select portfolio configurations for yet-unseen tasks. The algorithm runs in a zero-shot manner, that is without training any models online except the chosen one. In a compute- or time-constrained setting, this virtually instant selection is highly performant. Further, we show that our approach is effective for warm-starting existing autoML platforms. In both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Software Engineering Research · Software Testing and Debugging Techniques