Partial membership models for soft clustering of multivariate football player performance data
Emiliano Seri, Roberto Rocci, Thomas Brendan Murphy

TL;DR
This paper introduces a partial membership model for soft clustering of football player performance data, allowing players to belong to multiple positions simultaneously, which improves understanding of their playing styles.
Contribution
It applies Bayesian partial membership models to football data, comparing with other mixture models, and demonstrates practical insights for football strategy and talent assessment.
Findings
Partial membership models effectively identify players' multi-position tendencies.
Compared with other models, partial membership provides more nuanced player position profiles.
Application results can inform coaching and talent scouting decisions.
Abstract
The standard mixture modeling framework has been widely used to study heterogeneous populations, by modeling them as being composed of a finite number of homogeneous sub-populations. However, the standard mixture model assumes that each data point belongs to one and only one mixture component, or cluster, but when data points have fractional membership in multiple clusters this assumption is unrealistic. It is in fact conceptually very different to represent an observation as partly belonging to multiple groups instead of belonging to one group with uncertainty. For this purpose, various soft clustering approaches, or individual-level mixture models, have been developed. In this context, Heller et al (2008) formulated the Bayesian partial membership model (PM) as an alternative structure for individual-level mixtures, which also captures partial membership in the form of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Time Series Analysis and Forecasting · Data Mining Algorithms and Applications
