Mining Robust Default Configurations for Resource-constrained AutoML
Moe Kayali, Chi Wang

TL;DR
This paper introduces a zero-shot AutoML configuration selection method that mines diverse tasks to identify robust default configurations, improving performance across multiple datasets without online training.
Contribution
The paper presents a novel offline mining approach for selecting default AutoML configurations that generalize well to unseen tasks, enhancing efficiency and effectiveness.
Findings
Outperforms state-of-the-art on 62 datasets
Effective for warm-starting existing AutoML platforms
Recommends data-dependent default configurations
Abstract
Automatic machine learning (AutoML) is a key enabler of the mass deployment of the next generation of machine learning systems. A key desideratum for future ML systems is the automatic selection of models and hyperparameters. We present a novel method of selecting performant configurations for a given task by performing offline autoML and mining over a diverse set of tasks. By mining the training tasks, we can select a compact portfolio of configurations that perform well over a wide variety of tasks, as well as learn a strategy to select portfolio configurations for yet-unseen tasks. The algorithm runs in a zero-shot manner, that is without training any models online except the chosen one. In a compute- or time-constrained setting, this virtually instant selection is highly performant. Further, we show that our approach is effective for warm-starting existing autoML platforms. In both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Software Engineering Research · Software Testing and Debugging Techniques
