On Taking Advantage of Opportunistic Meta-knowledge to Reduce Configuration Spaces for Automated Machine Learning
David Jacob Kedziora, Tien-Dung Nguyen, Katarzyna Musial, Bogdan, Gabrys

TL;DR
This paper explores leveraging historical meta-knowledge to efficiently reduce configuration spaces in AutoML, improving search efficiency and model accuracy by avoiding poor pipeline evaluations based on dataset difficulty.
Contribution
It introduces methods to utilize opportunistic and systematic meta-knowledge for AutoML space reduction, analyzing their impact and sensitivity to dataset difficulty.
Findings
Meta-knowledge can enhance AutoML outcomes when relevant
Optimal space culling balances conservativeness and radicalness
Dataset difficulty influences meta-knowledge utility
Abstract
The automated machine learning (AutoML) process can require searching through complex configuration spaces of not only machine learning (ML) components and their hyperparameters but also ways of composing them together, i.e. forming ML pipelines. Optimisation efficiency and the model accuracy attainable for a fixed time budget suffer if this pipeline configuration space is excessively large. A key research question is whether it is both possible and practical to preemptively avoid costly evaluations of poorly performing ML pipelines by leveraging their historical performance for various ML tasks, i.e. meta-knowledge. The previous experience comes in the form of classifier/regressor accuracy rankings derived from either (1) a substantial but non-exhaustive number of pipeline evaluations made during historical AutoML runs, i.e. 'opportunistic' meta-knowledge, or (2) comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Data Stream Mining Techniques · Explainable Artificial Intelligence (XAI)
