Relieving and Readjusting Pythagoras
Victor Luo, Steven J. Miller

TL;DR
This paper introduces a new, more accurate Pythagorean expectation formula for baseball winning percentage, based on a linear combination of Weibull distributions, improving prediction accuracy by about 25% with theoretical support.
Contribution
It proposes a novel linear combination of Weibull distributions to better estimate team run production and winning percentage, with theoretical justification for parameters.
Findings
Increases prediction accuracy by approximately 25%.
Performs as well or better than existing formulas.
Easily computed from publicly available data.
Abstract
Bill James invented the Pythagorean expectation in the late 70's to predict a baseball team's winning percentage knowing just their runs scored and allowed. His original formula estimates a winning percentage of , where stands for runs scored and for runs allowed; later versions found better agreement with data by replacing the exponent 2 with numbers near 1.83. Miller and his colleagues provided a theoretical justification by modeling runs scored and allowed by independent Weibull distributions. They showed that a single Weibull distribution did a very good job of describing runs scored and allowed, and led to a predicted won-loss percentage of , where and are the observed runs scored…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistics Education and Methodologies · Sports Dynamics and Biomechanics · Sports Analytics and Performance
