An Open Source AutoML Benchmark

Pieter Gijsbers; Erin LeDell; Janek Thomas; S\'ebastien Poirier; Bernd; Bischl; Joaquin Vanschoren

arXiv:1907.00909·cs.LG·July 2, 2019·48 cites

An Open Source AutoML Benchmark

Pieter Gijsbers, Erin LeDell, Janek Thomas, S\'ebastien Poirier, Bernd, Bischl, Joaquin Vanschoren

PDF

Open Access

TL;DR

This paper presents an open-source, extensible benchmark framework for AutoML systems, enabling accurate and standardized comparisons across multiple datasets, and demonstrates its use by evaluating four AutoML tools.

Contribution

It introduces a comprehensive, open-source AutoML benchmarking framework that follows best practices and facilitates fair comparisons.

Findings

01

Compared 4 AutoML systems across 39 datasets

02

Identified strengths and weaknesses of each system

03

Provided up-to-date results via a dedicated website

Abstract

In recent years, an active field of research has developed around automated machine learning (AutoML). Unfortunately, comparing different AutoML systems is hard and often done incorrectly. We introduce an open, ongoing, and extensible benchmark framework which follows best practices and avoids common mistakes. The framework is open-source, uses public datasets and has a website with up-to-date results. We use the framework to conduct a thorough comparison of 4 AutoML systems across 39 datasets and analyze the results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Data Stream Mining Techniques