Quantifying Officiating Impact in the NBA: A Referee Impact Metric Analysis Using ESPN Win-Probability Data
Nirek Duma, Leo Benaharon

TL;DR
This paper introduces the Ref Impact Metric (RIM), a new game-level statistic that measures referee impact on NBA game outcomes by analyzing win-probability shifts caused by fouls, addressing limitations of traditional foul-based metrics.
Contribution
The paper develops and empirically validates RIM, a novel context-aware referee impact measure that captures the influence of officiating on game results beyond foul counts.
Findings
RIM is distinct from foul volume and disparity.
Certain referee and team patterns persist after controlling for context.
Patterns should be viewed as observational signals, not misconduct evidence.
Abstract
Over the past century, basketball analytics has moved from simple box-score rates toward complex context-aware measures that evaluate events by their expected effect on game outcomes. Officiating analysis has not made the same transition: existing work and public discussion still rely heavily on foul rates, foul differentials, reviewed late-game correctness labels, or team/player benefit from calls. This leaves an empirical gap because a low-leverage foul in a decided game should not be treated as equivalent to a whistle that materially shifts win probability in a close game. To address this gap, we introduce the Ref Impact Metric (RIM), a game-level statistic that aggregates the absolute win-probability movement attached to foul events, measuring the impact of each referee for each game. Using ESPN game-summary and win-probability data for NBA seasons 2021-2022 through 2024-2025, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
