Using Elo Rating as a Metric for Comparative Judgement in Educational Assessment
Andy Gray, Alma Rahat, Tom Crick, Stephen Lindsay, Darren Wallace

TL;DR
This paper explores using Elo ratings, a method from competitive gaming, as an efficient and reliable alternative to comparative judgement in educational assessment, validated through Twitter data on political humor.
Contribution
It demonstrates that Elo ratings closely match traditional comparative judgement rankings, offering a scalable and potentially less biased assessment method.
Findings
Elo ratings correlate highly with CJ rankings (Kendall's tau 0.96)
Elo method is statistically significantly similar to CJ
Potential for wider application in educational contexts
Abstract
Marking and feedback are essential features of teaching and learning, across the overwhelming majority of educational settings and contexts. However, it can take a great deal of time and effort for teachers to mark assessments, and to provide useful feedback to the students. Furthermore, it also creates a significant cognitive load on the assessors, especially in ensuring fairness and equity. Therefore, an alternative approach to marking called comparative judgement (CJ) has been proposed in the educational space. Inspired by the law of comparative judgment (LCJ). This pairwise comparison for as many pairs as possible can then be used to rank all submissions. Studies suggest that CJ is highly reliable and accurate while making it quick for the teachers. Alternative studies have questioned this claim suggesting that the process can increase bias in the results as the same submission is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStudent Assessment and Feedback · Software Engineering Research · Evaluation and Performance Assessment
