Evaluating one-shot tournament predictions

Claus Thorn Ekstr{\o}m; Hans Van Eetvelde; Christophe Ley and; Ulf Brefeld

arXiv:1912.07364·stat.AP·December 17, 2019

Evaluating one-shot tournament predictions

Claus Thorn Ekstr{\o}m, Hans Van Eetvelde, Christophe Ley and, Ulf Brefeld

PDF

TL;DR

This paper introduces the Tournament Rank Probability Score (TRPS), a new flexible metric for evaluating pre-tournament predictions, and demonstrates how to optimally combine historical data into ensemble predictions to improve accuracy.

Contribution

The paper presents the TRPS as a novel evaluation metric and proposes a method to combine historical tournament predictions into optimal ensembles.

Findings

01

TRPS effectively evaluates partial and full tournament predictions.

02

Weighted TRPS allows emphasizing specific features of predictions.

03

Ensemble methods using historical data improve prediction accuracy.

Abstract

We introduce the Tournament Rank Probability Score (TRPS) as a measure to evaluate and compare pre-tournament predictions, where predictions of the full tournament results are required to be available before the tournament begins. The TRPS handles partial ranking of teams, gives credit to predictions that are only slightly wrong, and can be modified with weights to stress the importance of particular features of the tournament prediction. Thus, the Tournament Rank Prediction Score is more flexible than the commonly preferred log loss score for such tasks. In addition, we show how predictions from historic tournaments can be optimally combined into ensemble predictions in order to maximize the TRPS for a new tournament.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.