OpenML Benchmarking Suites

Bernd Bischl; Giuseppe Casalicchio; Matthias Feurer; Pieter Gijsbers,; Frank Hutter; Michel Lang; Rafael G. Mantovani; Jan N. van Rijn; Joaquin; Vanschoren

arXiv:1708.03731·stat.ML·November 9, 2023·23 cites

OpenML Benchmarking Suites

Bernd Bischl, Giuseppe Casalicchio, Matthias Feurer, Pieter Gijsbers,, Frank Hutter, Michel Lang, Rafael G. Mantovani, Jan N. van Rijn, Joaquin, Vanschoren

PDF

Open Access 4 Repos

TL;DR

This paper advocates for standardized, comprehensive benchmarking suites in machine learning, introduces the OpenML platform for creating and sharing these benchmarks, and presents a curated classification suite for practical use.

Contribution

It introduces OpenML benchmarking suites, including a curated classification benchmark, enabling standardized, shareable, and reproducible machine learning evaluations.

Findings

01

OpenML suites are easy to use with standardized formats and APIs

02

The curated classification suite facilitates practical benchmarking

03

OpenML promotes sharing and reuse of benchmarking results

Abstract

Machine learning research depends on objectively interpretable, comparable, and reproducible algorithm benchmarks. We advocate the use of curated, comprehensive suites of machine learning tasks to standardize the setup, execution, and reporting of benchmarks. We enable this through software tools that help to create and leverage these benchmarking suites. These are seamlessly integrated into the OpenML platform, and accessible through interfaces in Python, Java, and R. OpenML benchmarking suites (a) are easy to use through standardized data formats, APIs, and client libraries; (b) come with extensive meta-information on the included datasets; and (c) allow benchmarks to be shared and reused in future studies. We then present a first, carefully curated and practical benchmarking suite for classification: the OpenML Curated Classification benchmarking suite 2018 (OpenML-CC18). Finally, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Explainable Artificial Intelligence (XAI)