Toward a Better Understanding of Leaderboard

Wenjie Zheng

arXiv:1510.03349·stat.ML·June 8, 2017·2 cites

Toward a Better Understanding of Leaderboard

Wenjie Zheng

PDF

Open Access

TL;DR

This paper analyzes the limitations of traditional leaderboards in machine learning competitions, proposes improvements to prevent overfitting, and discusses the theoretical aspects of leaderboard accuracy and complexity.

Contribution

It offers practical advice to improve leaderboard robustness, simplifies the Ladder leaderboard, and provides theoretical insights into sample complexity.

Findings

01

Ladder leaderboard can be simplified by removing redundant computations.

02

Sample complexity for accurate leaderboard estimation is cubic in the inverse of the precision.

03

Practical guidelines to prevent hacking and overfitting in leaderboards.

Abstract

The leaderboard in machine learning competitions is a tool to show the performance of various participants and to compare them. However, the leaderboard quickly becomes no longer accurate, due to hack or overfitting. This article gives two pieces of advice to prevent easy hack or overfitting. By following these advice, we reach the conclusion that something like the Ladder leaderboard introduced in [blum2015ladder] is inevitable. With this understanding, we naturally simplify Ladder by eliminating its redundant computation and explain how to choose the parameter and interpret it. We also prove that the sample complexity is cubic to the desired precision of the leaderboard.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDiverse Music Education Insights