Biases in Expected Goals Models Confound Finishing Ability
Jesse Davis, Pieter Robberechts

TL;DR
This paper critically examines biases in Expected Goals models in soccer, revealing that current metrics are confounded by data biases and proposing a subgroup-calibrated model to better assess players' finishing skills, exemplified by Messi.
Contribution
The study identifies biases in xG models affecting skill measurement and introduces a subgroup-calibrated approach to improve accuracy in evaluating finishing ability.
Findings
Overperformance requires high shot volume and exceptional finishing.
Including all shot types can obscure true finishing skill.
Biases cause standard xG models to underestimate Messi's finishing ability.
Abstract
Expected Goals (xG) has emerged as a popular tool for evaluating finishing skill in soccer analytics. It involves comparing a player's cumulative xG with their actual goal output, where consistent overperformance indicates strong finishing ability. However, the assessment of finishing skill in soccer using xG remains contentious due to players' difficulty in consistently outperforming their cumulative xG. In this paper, we aim to address the limitations and nuances surrounding the evaluation of finishing skill using xG statistics. Specifically, we explore three hypotheses: (1) the deviation between actual and expected goals is an inadequate metric due to the high variance of shot outcomes and limited sample sizes, (2) the inclusion of all shots in cumulative xG calculation may be inappropriate, and (3) xG models contain biases arising from interdependencies in the data that affect skill…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Sports Performance and Training · Sport Psychology and Performance
