TL;DR
This paper introduces an open, data-driven Skellam regression model to quantify individual player contributions in soccer, leveraging publicly available data to evaluate positional value and inform salary allocation.
Contribution
The paper develops a novel open framework using Skellam regression to estimate soccer players' contributions from publicly available data, addressing previous proprietary data limitations.
Findings
The model estimates the importance of different positional lines in winning games.
It translates contributions into expected league points above a replacement player (eLPAR).
Market undervalues defensive players relative to goalkeepers in the dataset.
Abstract
Soccer is undeniably the most popular sport world-wide and everyone from general managers and coaching staff to fans and media are interested in evaluating players' performance. Metrics applied successfully in other sports, such as the (adjusted) +/- that allows for division of credit among a basketball team's players, exhibit several challenges when applied to soccer due to severe co-linearities. Recently, a number of player evaluation metrics have been developed utilizing optical tracking data, but they are based on proprietary data. In this work, our objective is to develop an open framework that can estimate the expected contribution of a soccer player to his team's winning chances using publicly available data. In particular, using data from (i) approximately 20,000 games from 11 European leagues over 8 seasons, and, (ii) player ratings from the FIFA video game, we estimate through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
