Understanding and Pushing the Limits of the Elo Rating Algorithm
Leszek Szczecinski, Aymen Djebbi

TL;DR
This paper analyzes the mathematical foundations of the Elo rating system, clarifies its assumptions about draws, and introduces a flexible extension called κ-Elo that better adapts to different sports and draw frequencies.
Contribution
It explains the implicit assumptions of the Elo algorithm and proposes κ-Elo, a new rating method that adjusts for draw frequency, enhancing its applicability.
Findings
κ-Elo improves rating accuracy in sports with varying draw rates
The model clarifies Elo's assumptions about draws and extends its flexibility
Empirical results from football data demonstrate the effectiveness of κ-Elo
Abstract
This work is concerned with the rating of players/teams in face-to-face games with three possible outcomes: loss, win, and draw. This is one of the fundamental problems in sport analytics, where the very simple and popular, non-trivial algorithm was proposed by Arpad Elo in late fifties to rate chess players. In this work we explain the mathematical model underlying the Elo algorithm and, in particular, we explain what is the implicit but not yet spelled out, assumption about the model of draws. We further extend the model to provide flexibility and remove the unrealistic implicit assumptions of the Elo algorithm. This yields the new rating algorithm, we call -Elo, which is equally simple as the Elo algorithm but provides a possibility to adjust to the frequency of draws. The discussion of the importance of the appropriate choice of the parameters is carried out and illustrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Numerical Methods and Algorithms
