# next-gen-scraPy: Extracting NFL Tracking Data from Images to Evaluate   Quarterbacks and Pass Defenses

**Authors:** Sarah Mallepalle, Ron Yurko, Konstantinos Pelechrinis, Samuel L., Ventura

arXiv: 1906.03339 · 2019-12-09

## TL;DR

This paper presents a novel image processing tool to extract detailed NFL passing data from publicly available charts, enabling analysis of quarterback and defense performance comparable to proprietary tracking data.

## Contribution

The work introduces an image-based data extraction method and analytical models to evaluate NFL pass success spatially, bridging the gap between private and public data sources.

## Key findings

- Extracted pass data closely matches NFL tracking data
- Proposed CPAE metric aligns with NFL's proprietary measure
- Analyzed spatial tendencies of quarterbacks and defenses

## Abstract

The NFL collects detailed tracking data capturing the location of all players and the ball during each play. Although the raw form of this data is not publicly available, the NFL releases a set of aggregated statistics via their Next Gen Stats (NGS) platform. They also provide charts showing the locations of pass attempts and outcomes for individual quarterbacks. Our work aims to partially close the gap between what data is available privately (to NFL teams) and publicly, and our contribution is twofold. First, we introduce an image processing tool designed specifically for extracting the raw data from the NGS pass charts. We extract the pass outcome, coordinates, and other metadata. Second, we analyze the resulting dataset, examining the spatial tendencies and performances of individual quarterbacks and defenses. We use a generalized additive model for completion percentages by field location. We introduce a Naive Bayes approach for estimating the 2-D completion percentage surfaces of individual teams and quarterbacks, and we provide a one-number summary, completion percentage above expectation (CPAE), for evaluating quarterbacks and team defenses. We find that our pass location data closely matches the NFL's tracking data, and that our CPAE metric closely matches the NFL's proprietary CPAE metric.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.03339/full.md

## Figures

42 figures with captions in the complete paper: https://tomesphere.com/paper/1906.03339/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/1906.03339/full.md

---
Source: https://tomesphere.com/paper/1906.03339