Integrating Unsupervised and Supervised Learning for the Prediction of Defensive Schemes in American football
Rouven Michels, Robert Bajons, Jan-Ole Fischer

TL;DR
This paper presents a novel statistical framework combining supervised and unsupervised learning to predict defensive schemes in American football using player tracking data, improving real-time recognition accuracy.
Contribution
It introduces an integrated approach using HMM-derived features with elastic net and gradient boosting models for better coverage scheme prediction.
Findings
HMM-based features improve predictive accuracy
Significant association between features and coverage outcomes
Provides interpretable insights into defensive adjustments
Abstract
Anticipating defensive coverage schemes is a crucial yet challenging task for offenses in American football. Because defenders' assignments are intentionally disguised before the snap, they remain difficult to recognize in real time. To address this challenge, we develop a statistical framework that integrates supervised and unsupervised learning using player tracking data. Our goal is to forecast the defensive coverage scheme -- man or zone -- through elastic net logistic regression and gradient-boosted decision trees with incrementally derived features. We first use features from the pre-motion situation, then incorporate players' trajectories during motion in a naive way, and finally include features derived from a hidden Markov model (HMM). Based on player movements, the non-homogeneous HMM infers latent defensive assignments between offensive and defensive players during motion and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Performance and Training · Sports Analytics and Performance · Time Series Analysis and Forecasting
