Judging the Judges: A General Framework for Evaluating the Performance of International Sports Judges
Sandro Heiniger, Hugues Mercier

TL;DR
This paper introduces a universal framework to evaluate the accuracy and bias of sports judges across multiple disciplines, revealing systemic issues and enabling fairer judging practices.
Contribution
It presents a broadly applicable method to assess judging accuracy and bias, identifying intrinsic judging errors and systemic problems in sports judging systems.
Findings
Judging errors follow a quadratic pattern related to athlete performance.
The framework can distinguish between unintentional errors and cheating.
Systemic judging issues are identified in dressage.
Abstract
The monitoring of judges and referees in sports has become an important topic due to the increasing media exposure of international sporting events and the large monetary sums involved. In this article, we present a method to assess the accuracy of sports judges and estimate their bias. Our method is broadly applicable to all sports where panels of judges evaluate athletic performances on a finite scale. We analyze judging scores from eight different sports with comparable judging systems: diving, dressage, figure skating, freestyle skiing (aerials), freestyle snowboard (halfpipe, slopestyle), gymnastics, ski jumping and synchronized swimming. With the notable exception of dressage, we identify, for each aforementioned sport, a general and accurate pattern of the intrinsic judging error as a function of the performance level of the athlete. This intrinsic judging inaccuracy is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Doping in Sports
