What If They Took the Shot? A Hierarchical Bayesian Framework for Counterfactual Expected Goals
Mikayil Mahmudlu, Oktay Karaku\c{s}, Hasan Arkada\c{s}

TL;DR
This paper introduces a hierarchical Bayesian model for expected goals in soccer, incorporating expert knowledge to improve player-specific estimates and enable counterfactual analysis for tactical insights.
Contribution
It develops a novel Bayesian framework that combines expert priors with shot data to better estimate individual player finishing abilities and simulate hypothetical scenarios.
Findings
Hierarchical model reduces posterior uncertainty compared to weak priors.
Model achieves high correlation (R2=0.75) with baseline predictions and R2=0.833 with benchmark.
Identifies distinct player specialisation profiles and latent abilities.
Abstract
This study develops a hierarchical Bayesian framework that integrates expert domain knowledge to quantify player-specific effects in expected goals (xG) estimation, addressing a limitation of standard models that treat all players as identical finishers. Using 9,970 shots from StatsBomb's 2015-16 data and Football Manager 2017 ratings, we combine Bayesian logistic regression with informed priors to stabilise player-level estimates, especially for players with few shots. The hierarchical model reduces posterior uncertainty relative to weak priors and achieves strong external validity: hierarchical and baseline predictions correlate at R2 = 0.75, while an XGBoost benchmark validated against StatsBomb xG reaches R2 = 0.833. The model uncovers interpretable specialisation profiles, including one-on-one finishing (Aguero, Suarez, Belotti, Immobile, Martial), long-range shooting (Pogba), and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Sports Performance and Training · Sport Psychology and Performance
