Forecasting the Olympic medal distribution during a pandemic: a socio-economic machine learning model
Christoph Schlembach, Sascha L. Schmidt, Dominik Schreyer, Linus, Wunderlich

TL;DR
This paper introduces a socio-economic machine learning model using a two-staged Random Forest to forecast Olympic medal distributions, outperforming traditional methods and suggesting pandemic impacts are limited.
Contribution
The paper presents a novel two-staged Random Forest approach for Olympic medal forecasting, incorporating socio-economic factors and pandemic data, improving accuracy over naive models.
Findings
US predicted to lead with 120 medals
Pandemic impact on medal counts is limited
Model outperforms traditional forecasting methods
Abstract
Forecasting the number of Olympic medals for each nation is highly relevant for different stakeholders: Ex ante, sports betting companies can determine the odds while sponsors and media companies can allocate their resources to promising teams. Ex post, sports politicians and managers can benchmark the performance of their teams and evaluate the drivers of success. To significantly increase the Olympic medal forecasting accuracy, we apply machine learning, more specifically a two-staged Random Forest, thus outperforming more traditional na\"ive forecast for three previous Olympics held between 2008 and 2016 for the first time. Regarding the Tokyo 2020 Games in 2021, our model suggests that the United States will lead the Olympic medal table, winning 120 medals, followed by China (87) and Great Britain (74). Intriguingly, we predict that the current COVID-19 pandemic will not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Sport and Mega-Event Impacts
