Hockey Player Performance via Regularized Logistic Regression
Robert B. Gramacy, Matt Taddy, and Sen Tian

TL;DR
This paper introduces a regularized logistic regression approach to accurately estimate individual hockey players' contributions by modeling game situations and controlling for confounding factors, overcoming limitations of traditional plus-minus metrics.
Contribution
It presents a novel high-dimensional logistic regression model with regularization to infer individual player effects in hockey, accounting for complex game situations and interactions.
Findings
Effective estimation of player contributions in high-dimensional settings
Regularized logistic regression improves interpretability and stability
Software tools facilitate practical application of the methodology
Abstract
A hockey player's plus-minus measures the difference between goals scored by and against that player's team while the player was on the ice. This measures only a marginal effect, failing to account for the influence of the others he is playing with and against. A better approach would be to jointly model the effects of all players, and any other confounding information, in order to infer a partial effect for this individual: his influence on the box score regardless of who else is on the ice. This chapter describes and illustrates a simple algorithm for recovering such partial effects. There are two main ingredients. First, we provide a logistic regression model that can predict which team has scored a given goal as a function of who was on the ice, what teams were playing, and details of the game situation (e.g. full-strength or power-play). Since the resulting model is so high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance
