AMLB: an AutoML Benchmark
Pieter Gijsbers, Marcos L. P. Bueno, Stefan Coors, Erin LeDell,, S\'ebastien Poirier, Janek Thomas, Bernd Bischl, Joaquin Vanschoren

TL;DR
This paper introduces an open, extensible benchmark for comparing AutoML frameworks, providing a comprehensive evaluation across multiple tasks and metrics, with an open-source tool for automated empirical analysis.
Contribution
It presents a standardized, best-practice benchmark for AutoML frameworks, including an open-source tool for comprehensive, automated evaluation and analysis.
Findings
Significant performance differences among frameworks
Trade-offs between accuracy and inference time identified
Framework rankings vary across different task subsets
Abstract
Comparing different AutoML frameworks is notoriously challenging and often done incorrectly. We introduce an open and extensible benchmark that follows best practices and avoids common mistakes when comparing AutoML frameworks. We conduct a thorough comparison of 9 well-known AutoML frameworks across 71 classification and 33 regression tasks. The differences between the AutoML frameworks are explored with a multi-faceted analysis, evaluating model accuracy, its trade-offs with inference time, and framework failures. We also use Bradley-Terry trees to discover subsets of tasks where the relative AutoML framework rankings differ. The benchmark comes with an open-source tool that integrates with many AutoML frameworks and automates the empirical evaluation process end-to-end: from framework installation and resource allocation to in-depth evaluation. The benchmark uses public data sets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications
