Modeling and Prediction of the UEFA EURO 2024 via Combined Statistical Learning Approaches
Andreas Groll, Lars M. Hvattum, Christophe Ley, Jonas Sternemann,, Gunther Schauberger, Achim Zeileis

TL;DR
This paper develops a combined machine learning model using linear, random forest, and gradient boosting techniques to forecast UEFA EURO 2024 outcomes, incorporating diverse data sources and simulating tournament results.
Contribution
It introduces a novel ensemble approach that integrates multiple models and diverse covariates for accurate football tournament predictions.
Findings
France identified as the top favorite with 19.2% winning probability.
The model effectively predicts match outcomes and tournament probabilities.
Simulation of 100,000 tournaments provides detailed probability distributions.
Abstract
In this work, three fundamentally different machine learning models are combined to create a new, joint model for forecasting the UEFA EURO 2024. Therefore, a generalized linear model, a random forest model, and a extreme gradient boosting model are used to predict the number of goals a team scores in a match. The three models are trained on the match results of the UEFA EUROs 2004-2020, with additional covariates characterizing the teams for each tournament as well as three enhanced variables derived from different ranking methods for football teams. The first enhanced variable is based on historic match data from national teams, the second is based on the bookmakers' tournament winning odds of all participating teams, and the third is based on historic match data of individual players both for club and international matches, resulting in player ratings. Then, based on current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Sports Analytics and Performance · Statistical and Computational Modeling
