Learning, Visualizing, and Exploiting a Model for the Intrinsic Value of a Batted Ball
Glenn Healey

TL;DR
This paper introduces a Bayesian-based algorithm to quantify the intrinsic value of batted balls in baseball, separating contact quality from external factors, and providing new metrics for player evaluation and environmental impact analysis.
Contribution
The work presents a novel nonparametric Bayesian model that estimates the intrinsic value of batted balls, improving player and defense evaluation methods.
Findings
Developed a kernel-based Bayesian model for batted ball value
Derived new statistics for intrinsic contact quality of players
Provided insights into environmental effects on batted ball outcomes
Abstract
We present an algorithm for learning the intrinsic value of a batted ball in baseball. This work addresses the fundamental problem of separating the value of a batted ball at contact from factors such as the defense, weather, and ballpark that can affect its observed outcome. The algorithm uses a Bayesian model to construct a continuous mapping from a vector of batted ball parameters to an intrinsic measure defined as the expected value of a linear weights representation for run value. A kernel method is used to build nonparametric estimates for the component probability density functions in Bayes theorem from a set of over one hundred thousand batted ball measurements recorded by the HITf/x system during the 2014 major league baseball (MLB) season. Cross-validation is used to determine the optimal vector of smoothing parameters for the density estimates. Properties of the mapping are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Data Analysis with R · Time Series Analysis and Forecasting
