Can AutoML outperform humans? An evaluation on popular OpenML datasets   using AutoML Benchmark

Marc Hanussek; Matthias Blohm; Maximilien Kintz

arXiv:2009.01564·cs.LG·December 16, 2020

Can AutoML outperform humans? An evaluation on popular OpenML datasets using AutoML Benchmark

Marc Hanussek, Matthias Blohm, Maximilien Kintz

PDF

TL;DR

This study evaluates whether AutoML frameworks can outperform human data scientists by comparing four AutoML tools on 12 popular datasets, finding that AutoML performs better or equally in over half of the tasks.

Contribution

The paper provides a comprehensive comparison of AutoML frameworks against human performance on diverse datasets, highlighting AutoML's competitive capabilities.

Findings

01

AutoML outperforms or matches human results in 7 of 12 datasets

02

AutoML performs well on both classification and regression tasks

03

AutoML shows promise for real-world applications

Abstract

In the last few years, Automated Machine Learning (AutoML) has gained much attention. With that said, the question arises whether AutoML can outperform results achieved by human data scientists. This paper compares four AutoML frameworks on 12 different popular datasets from OpenML; six of them supervised classification tasks and the other six supervised regression ones. Additionally, we consider a real-life dataset from one of our recent projects. The results show that the automated frameworks perform better or equal than the machine learning community in 7 out of 12 OpenML tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.