AI based tool for monitoring intensity and fatigue in elite women handball
Florian Felice

TL;DR
This paper introduces an AI tool that helps track and predict performance and fatigue in elite women handball players using wearable data.
Contribution
The novel contribution is an AI tool combining predictive accuracy and explainability for monitoring athlete intensity and fatigue.
Findings
The AI tool accurately predicts key performance indicators like running distance and speed.
The model provides actionable insights for optimizing training and lineup strategies.
The approach can be extended to predict other physiological KPIs in handball.
Abstract
We propose an AI-based tool to predict and monitor Key Performance Indicators (KPIs) for player’s activity such as running distance and speed from wearable devices. These KPIs serve as proxies for intensity and fatigue levels in professional athletes. Applied to a women’s professional handball team competing at the EHF Champions League level, our model helps predict player workload and physiological stress, enabling accurate monitoring of player condition. By combining predictive accuracy with explainability methods, our tool not only forecasts fatigue and intensity metrics but also provides actionable insights for coaching staff to optimize training and lineup strategies. This work demonstrates the potential of advanced machine learning methods and can be extended to the prediction of any physiological KPI to support handball performance monitoring.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2| Position | Distance | Max speed | Metabolic power | Acceleration load accumulated | Jumps | Changes of direction |
|---|---|---|---|---|---|---|
| (m) | (km/h) | (W) | ||||
| Wing | 5294.5 | 26.1 | 5560.8 | 604.9 | 17.6 | 55.9 |
| Back | 4392.9 | 24.7 | 8173.6 | 540.2 | 14.2 | 35.2 |
| Pivot | 3964.4 | 22.4 | 4388.9 | 532.1 | 14.5 | 22.9 |
|
|
|
|
|
|
|
|
| Model/metric | Distance total | Metabolic power max | Acceleration load accumulated | Speed max |
|---|---|---|---|---|
| Moving average | 1480.33 | 1116.03 | 164.74 | 4.28 |
| ARIMA | 1515.09 | 1267.72 | 168.56 | 4.03 |
| Exponential smoothing | 1514.36 | 1273.93 | 175.95 | 4.03 |
| LSTM (proposed) |
|
|
|
|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Performance and Training · Sports Analytics and Performance · Explainable Artificial Intelligence (XAI)
Introduction
1
Motivation
1.1
Over the last few years, elite team sports have increasingly embraced data-driven approaches to optimize player performance, reduce injury risk, and enhance tactical decision-making. Today, the sports analytics literature on wearable sensors data is rich (1). Several sports such as basketball, american football, soccer or rugby have fully adopted these technologies (2–5).
As the technology becomes more affordable and the need to include advanced analytics to optimize elite performances increases, handball teams and federations started to include video analytics and wearable sensors as part of the preparation for competitions. The European Handball Federation (EHF) has been at the forefront of this approach by including wearable sensors analysis during the recent european championships competitions (6). However, despite the growing adoption of wearable sensors in team sports, research on predictive models for monitoring performance KPIs from sensor data remains limited, particularly in handball where the literature is scarce beyond descriptive analytics (6). While machine learning has been successfully applied to injury prediction and match outcome forecasting in various sports (1), there is a gap in developing predictive tools that combine forecasting capabilities with explainability to make wearable sensor data actionable for coaches. Our work addresses this gap by proposing an AI-based approach that not only predicts key performance indicators but also provides explainable and actionable insights to support data-driven decision-making in handball.
Elite European teams such as Metz Handball (France) strive to win the world’s most prestigious competition, the EHF Champions League. To reach this goal, the team and staff constantly try to reinvent the way they work and include new tools and technologies in their daily work to push the boundaries of performance. In this context, we collaborate with the team to support them in designing tools from data they extract from wearable sensors. These tools aim to help the team in their quest for additional trophies and titles.
Wearable sensors
1.2
Our data are extracted from historical sensor data worn by players from the start of the 2024/2025 season. The gyroscopic sensors are worn on the upper back by players during team trainings and matches. The sensors collect information about movements in 3D space and record directions, speed and intensity of movements and jumps. The collected data are then summarized by session (one match or one team training) and constitute a series as more sessions get added.
Our data set contains a total of 1,617 sessions (108 sessions per player on average) over the 2024/2025 season. One session corresponds to either a match or a team training. The key KPIs of interest for our forecast are the total distance run during the session, the maximum speed, maximal metabolic power and the accumulated acceleration load.
The sensors are worn by the wing, back and pivot players during the sessions. After each session, the staff retrieves the data through the Kinexon API (sensor provider), which automatically formats and summarizes the metrics per session and player for our analysis and modeling.
Paper overview
1.3
Working directly with Metz Handball’s staff, we developed a practical machine learning approach for predicting player activity metrics from wearable sensor data. Our framework combines predictive modeling with explainable AI techniques to create actionable insights for coaching decisions. The solution is deployed and leveraged for matches including the local French championship and the EHF Champions League.
In this document, we provide an evaluation of multiple modeling approaches for forecasting a set of performance indicators such as running distance, maximum speed, and metabolic power. Beyond predictive accuracy, we integrate explainability methods to highlight the most influential patterns on future predictions. Such explanations allow coaches to understand both what the model predicts and why.
The rest of the document is organized as follows. Section 2 describes the methodology including data processing, baseline models, the proposed architecture and the explainability techniques used. Section 3 presents the predictive performance and insights from the explainability layer. Section 4 discusses with prior handball-match prediction work and outlines future directions. Finally, Section 5 concludes the paper.
Methodology
2
In this section, we describe the methodology used in our study, including data collection, preprocessing, and modeling techniques.
Data analysis & preparation
2.1
Our primary data analysis, summarized in Table 1, highlights expected performance patterns across positions. These reflect the physiological demand and specific roles for each players in handball.
We first note that the positions displayed are for Wing players (left and right combined), Back players (left, center and right combined) and Pivot (line player). Goalkeepers are not included in this analysis as no data is available since they do not wear sensors. Indeed, given their specific role, the staff considered (after several experiments) that the data collected from goalkeepers were not relevant and not adequate to monitor any form of performance.
The first pattern is highlighted by the larger total distance run by wing players (5294.5 m) as well as the higher average maximum speed compared to other positions (26.1 km/h). This reflects the specific position on the court which requires longer distances to run as well as their strategic role in fast breaks. The high total distance and maximum speed also illustrate one of the key differentiators for Metz Handball across the European stage with high intensity and multiple fast breaks, primarily driven by their wingers.
Back players show an important difference on the maximum metabolic power measured during a session (8173.6 W). This translates the specific activity of the central players in defense as well as in attack. Indeed, back players are often having lots of contacts with high intensity on both sides of the court. Additionally, the average maximum speed (24.7 km/h) and total distance (4392.9 m) also suggest that back players also play a key role in turnover situations, especially with dynamic center back players such as Petra Vamos and Léna Grandveau.
Finally, we can observe that pivots (line players) show less changes of directions or exertions than wing or back players. This may suggest that the role is more “static.” However, their central role in defense and the positioning in attack suggest less distance to run compared to other players and the positioning may be, from a sensor perspective, considered as less intense than other roles.
Some of these indicators are also recorded and analyzed on the international stage. The analysis of the 2024 European Championships highlighted the swiss player Mia Emmenegger recorded an average maximum speed of 28.28 km/h, the highest from the competition. As a comparison, the average maximum speed across the whole season for Metz is 28.03 km/h for Chloé Valentini, in similar high standards. Additionally, the maximum top speed in that season was recorded at 30.24 km/h, also by wing player Chloé Valentini.
Baseline models
2.2
A first approach to model player’s workload is to leverage classical time series techniques and predict each KPI individually. We establish performance benchmarks using three classical time series forecasting methods: an simple moving average, an autoregressive moving average (ARIMA) model, and an exponential smoothing model. The Moving Average predicts the next time stamps as the mean of the last observations, providing a simple trend-following baseline. The ARIMA (AutoRegressive Integrated Moving Average) model extends the moving average with an autoregressive step to capture linear dependencies and seasonal patterns. Finally, Exponential Smoothing applies exponentially decreasing weights to historical observations, putting more emphasis on recent data.
These univariate approaches serve as reference models for comparing our proposed approach.
Proposed architecture
2.3
Our proposed approach depicted in Figure 1 leverages a recurrent neural network with LSTM cells (7) as a multi-target approach to predict multiple series. Each predicted series corresponds to a specific KPI for an individual player. We thus have as many targets to predict as input series. We also add another series describing the type of session (match or training) as an exogenous input. This series will not be predicted by the model and will help capture specific patterns. Finally, we include an embedding layer to learn the individual representation of the players (position, individual abilities, etc.). The input series and the embedding layer are then concatenated and feed a fully connected network.
Model architecture with LSTM cells.
We note that our approach allows to include static information (such as player information) with a dense input layer that can be merged with the other inputs in a similar manner. Additionally, the hyper-parameters of our network (such as number of LSTM cells, width and depth of the fully connected network, etc) are optimized using some hyper-parameter optimization (8).
Our approach also includes drop out for the LSTM cells and dense layers. This approach is used to add regularization to the model in the training step but is also used to measure the uncertainty during the predictions. Indeed, we leverage the Monte-Carlo Dropout (MC Dropout) methodology (9) to measure the model’s uncertainty and derive confidence intervals.
Explainability layer
2.4
To provide the staff with actionable insights from model predictions, we implement gradient-based feature importance analysis. This technique computes the sensitivity of predictions to input features by calculating gradients with respect to each biomechanical variable. The resulting importance scores reveal which player metrics most influence the performance forecasts for each KPI.
We leverage a large language model (LLM) using the Ph4-mini reasoning model (10) to transform these technical importance scores into natural language explanations. The LLM generates contextual insights that the staff can directly interpret and act upon. For instance, when the model predicts decreased power output, the system might explain that elevated fatigue indicators in recent time steps are the primary contributing factors, suggesting specific intervention strategies.
Results
3
Predictive performance
3.1
We evaluate our proposed LSTM-based approach against the presented baseline models to assess predictive accuracy across our target KPIs. We predict the next 10 sessions for each player on our test dataset. We present the Root Mean Square Error (RMSE) results across each KPI in Table 2. Our LSTM model consistently outperforms all baseline models across every measured KPI. The greatest accuracy gain is observed for the Distance Total metric, where the LSTM achieves an RMSE of 1273.45 which is 14.0% lower than the best baseline performance.
We note that the greatest gain is observed for Distance Total which can be explained by the link with the type of session. Indeed, the type of session (match or training) has a direct impact on the total distance run by players. Therefore, our LSTM model is able to capture these patterns by taking these exogenous inputs into account and provide more accurate predictions. Additionally, the multi-target approach allows the model to leverage non-explicit correlations between the different KPIs to improve accuracy.
We illustrate in Figure 2 the model’s prediction quality for a randomly selected player, specifically showing acceleration load accumulation forecasts over a 10-step prediction horizon. We can observe that all actual observations fall within the 95% confidence intervals generated through MC Dropout. The confidence level is set by default to 95% and will be adjusted based on empirical feedback from the coaching staff throughout the deployment of the solution.
LSTM prediction accuracy for acceleration load accumulation over 10 time steps on the test set. The model achieves RMSE = 107.30 and MAPE = 18.5%. We observe that the actual observations are all within the 95% confidence intervals.
These confidence intervals will provide valuable information for coaches, as they understand the uncertainty associated with each prediction.
Explainability insights & real-world deployment
3.2
The gradient-based feature importance analysis identifies recent training intensity and player position as key drivers for distance predictions, while metabolic power forecasts rely heavily on historical outputs and recovery periods. The Large Language Model (Ph4-mini with reasoning) translates these technical scores into actionable coaching insights, generating natural language explanations highlighting potential player fatigue risks.
The MC Dropout uncertainty quantification allows us to adjust confidence levels in predictions, while the explainability layer provides the coach with insights, essential for adoption in professional sports environments.
Discussion
4
Comparison with prior handball-match work
4.1
Our work follows a natural continuation from (10). Outside of the technical change from tabular to sequential data, the focus of this work oriented towards workload and physical activity monitoring. The direction of the work still starts backwards from athletes and coaches needs of understanding player’s activity to monitor workload and optimize the team’s performance. We include explainable AI methods as a critical layer to extract model’s learnt patterns and feed into a large language model to translate technical conclusions into sports insights and actions.
While the intent from match prediction and workload monitoring is different, both works have, in their roots, still the same objective: anticipate trends and optimize the team’s performance. Therefore, considering the implications from Section 3.2, the results from this work and match predictions can be used in conjunction and seen as complementary. Match predictions, along with their AI-generated explanations, can be considered as the anticipation of match patterns based on matches history, workload predictions can be used to understand physical constraints. While the predictions explanations will provide trends where some players can be expected to have an important impact on the match, the coach can plan for adapted strategies given the predicted ability for players to put physical intensity.
Future work and potential extensions
4.2
This work is done leveraging a specific wearable sensors technology for movement patterns. The developped framework is not restricted to any type of data and can be extended to other sources. In particular, discussions with the handball professionals also suggested the integration of cardiological data to monitor heartbeat and physical intensity. Therefore, our work can be replicated for other sports, sensors technology and even for recreational purposes.
Professional athletes and their staff also monitor health via survey tools. This allows coaches to collect feedback on players’ potential fatigue, injuries or health problems. Furthermore, the menstrual cycle of women’s athletes is also monitored to adapt the physical expectations of players. The addition of such data could constitute a complete set to provide insightful information to the model, and thus to the coach, on how to prepare matches and adapt players workloads.
Conclusion
5
We presented an AI-based tool for monitoring player workload metrics from wearable sensor data. The prediction performance of the LSTM step achieved superior performance over classical time series methods across all evaluated KPIs. The integration of explainable AI through gradient-based feature importance and natural language generation enables coaches to understand and act upon model predictions without technical expertise.
The successful deployment at Metz Handball during the 2025/2026 season will demonstrate the practical value of combining predictive accuracy with interpretability in professional sports environments. Beyond its application to handball, our work constitutes a generalizable framework for predictive modeling with explainability that extends to any domain involving time series forecasting for individuals where interpretable insights are essential for decision-making. While we demonstrate the approach with wearable sensors in professional handball, providing coaches with actionable information to monitor player performance, the methodology can be readily adapted to other sports, health monitoring applications, or any field requiring personalized time series predictions with transparent explanations. The tool’s modular design allows extension to other sports, sensor technologies, and physiological data sources, supporting broader applications in sports, ranging from young and recreational athletes to elite performance monitoring injury and optimizing for performance.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Rana M Mittal V. Wearable sensors for real-time kinematics analysis in sports: a review. IEEE Sens J. (2021) 21:1187–207. 10.1109/JSEN.2020.3019016 · doi ↗
- 2Hollaus B Stabinger S Mehrle A Raschner C. Using wearable sensors and a convolutional neural network for catch detection in American football. Sensors. (2020) 20:6722. 10.3390/s 2023672233255462 PMC 7727841 · doi ↗ · pubmed ↗
- 3Kelly D Coughlan GF Green BS Caulfield B. Automatic detection of collisions in elite level rugby union using a wearable sensing device. Sports Eng. (2012) 15:81–92. 10.1007/s 12283-012-0088-5 · doi ↗
- 4Kuhlman N Min C-H. Analysis and classification of basketball shooting form using wearable sensor systems. In: 2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC). (2021). p. 1478–82.
- 5Piłka T Grzelak B Sadurska A Górecki T Dyczkowski K. Predicting injuries in football based on data collected from GPS-based wearable sensors. Sensors. (2023) 23:1227. 10.3390/s 2303122736772266 PMC 9919698 · doi ↗ · pubmed ↗
- 6Manchado C Tortosa Martínez J Pueo B Cortell Tormo JM Vila H Ferragut C, et al. High-performance handball player’s time-motion analysis by playing positions. Int J Environ Res Public Health. (2020) 17:6768. 10.3390/ijerph 1718676832957441 PMC 7559068 · doi ↗ · pubmed ↗
- 7Hochreiter S Schmidhuber J. Long short-term memory. Neural Comput. (1997) 9:1735–80. 10.1162/neco.1997.9.8.17359377276 · doi ↗ · pubmed ↗
- 8Bergstra J Komer B Eliasmith C Yamins D Cox DD. Hyperopt: a Python library for model selection and hyperparameter optimization. Comput Sci Discov. (2015) 8:014008. 10.1088/1749-4699/8/1/014008 · doi ↗
