AutoML to Date and Beyond: Challenges and Opportunities
Shubhra Kanti Karmaker Santu, Md. Mahadi Hassan, Micah J. Smith, Lei, Xu, ChengXiang Zhai, Kalyan Veeramachaneni

TL;DR
This paper reviews the current state of AutoML, introduces a new seven-level classification system based on autonomy, and discusses future challenges and research directions for fully automating the machine learning pipeline.
Contribution
It proposes a novel level-based taxonomy for AutoML systems, providing a structured framework to assess their autonomy and identify research gaps for full automation.
Findings
Existing AutoML systems automate some pipeline subtasks
Manual steps limit AutoML's accessibility for domain experts
Roadmap outlined for achieving full pipeline automation
Abstract
As big data becomes ubiquitous across domains, and more and more stakeholders aspire to make the most of their data, demand for machine learning tools has spurred researchers to explore the possibilities of automated machine learning (AutoML). AutoML tools aim to make machine learning accessible for non-machine learning experts (domain experts), to improve the efficiency of machine learning, and to accelerate machine learning research. But although automation and efficiency are among AutoML's main selling points, the process still requires human involvement at a number of vital steps, including understanding the attributes of domain-specific data, defining prediction problems, creating a suitable training data set, and selecting a promising machine learning technique. These steps often require a prolonged back-and-forth that makes this process inefficient for domain experts and data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Data Stream Mining Techniques
