An Analysis of an Alternative Pythagorean Expected Win Percentage Model: Applications Using Major League Baseball Team Quality Simulations
Justin Ehrlich, Christopher Boudreaux, James Boudreau, and Shane, Sanders

TL;DR
This study compares traditional and alternative models for estimating MLB team quality using extensive simulations, finding that the difference-form CSF model provides better explanatory power and less information loss than the Pythagorean model.
Contribution
It introduces a simulation-based approach to evaluate and compare expected win percentage models, highlighting the superiority of the difference-form CSF model over the traditional Pythagorean model.
Findings
Difference-form CSF outperforms Pythagorean model in explanatory power.
Simulation method yields realistic statistical outcomes.
Alternative CSF models improve win estimation accuracy.
Abstract
We ask if there are alternative contest models that minimize error or information loss from misspecification and outperform the Pythagorean model. This article aims to use simulated data to select the optimal expected win percentage model among the choice of relevant alternatives. The choices include the traditional Pythagorean model and the difference-form contest success function (CSF). Method. We simulate 1,000 iterations of the 2014 MLB season for the purpose of estimating and analyzing alternative models of expected win percentage (team quality). We use the open-source, Strategic Baseball Simulator and develop an AutoHotKey script that programmatically executes the SBS application, chooses the correct settings for the 2014 season, enters a unique ID for the simulation data file, and iterates these steps 1,000 times. We estimate expected win percentage using the traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance
