Learning Risk Preferences in Markov Decision Processes: an Application to the Fourth Down Decision in the National Football League

Nathan Sandholtz; Lucas Wu; Martin Puterman; and Timothy C. Y. Chan

arXiv:2309.00756·stat.AP·March 6, 2026

Learning Risk Preferences in Markov Decision Processes: an Application to the Fourth Down Decision in the National Football League

Nathan Sandholtz, Lucas Wu, Martin Puterman, and Timothy C. Y. Chan

PDF

Open Access 1 Repo

TL;DR

This paper models NFL coaches' fourth down decisions as risk-sensitive Markov decision processes, revealing their conservative risk preferences and how these vary by field position and over time.

Contribution

It introduces an inverse optimization framework to infer coaches' risk preferences from actual decision data in NFL games.

Findings

01

Coaches' decisions align with low-quantile risk preferences.

02

Risk tolerance increases in opponent's half.

03

League-wide risk preferences have become more aggressive over time.

Abstract

For decades, National Football League (NFL) coaches' observed fourth down decisions have been largely inconsistent with prescriptions based on statistical models. In this paper, we develop a framework to explain this discrepancy using an inverse optimization approach. We model the fourth down decision and the subsequent sequence of plays in a game as a Markov decision process (MDP), the dynamics of which we estimate from NFL play-by-play data from the 2014 through 2022 seasons. We assume that coaches' observed decisions are optimal but that the risk preferences governing their decisions are unknown. This yields an inverse decision problem for which the optimality criterion, or risk measure, of the MDP is the estimand. Using the quantile function to parameterize risk, we estimate which quantile-optimal policy yields the coaches' observed decisions as minimally suboptimal. In general, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nsandholtz/fourth_down_risk
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSports Analytics and Performance · Forecasting Techniques and Applications · Sports Performance and Training