Bioinformatics-Inspired IMU Stride Sequence Modeling for Fatigue Detection Using Spectral–Entropy Features and Hybrid AI in Performance Sports
Attila Biró, Levente Kovács, László Szilágyi

TL;DR
This study uses IMUs and AI to detect fatigue in runners by analyzing stride patterns, showing that personalized models work much better than general ones.
Contribution
A bioinformatics-inspired framework combining spectral-entropy features and hybrid AI for personalized fatigue detection using IMUs.
Findings
Personalized Random Forest models achieved 97.7% accuracy in detecting fatigue.
Entropy metrics showed consistent increases in movement irregularity during fatigue.
Population-level models had only 55% accuracy, emphasizing the need for personalization.
Abstract
Wearable inertial measurement units (IMUs) provide an accessible means of monitoring fatigue-related changes in running biomechanics, yet most existing methods rely on limited feature sets, lack personalization, or fail to generalize across individuals. This study introduces a bioinformatics-inspired stride sequence modeling framework that integrates spectral–entropy features, sample entropy, frequency-domain descriptors, and mixed-effects statistical modeling to detect fatigue using a single lumbar-mounted IMU. Nineteen recreational runners completed non-fatigued and fatigued 400 m runs, from which we extracted stride-level features and evaluated (1) population-level fatigue classification via global leave-one-participant-out (LOPO) models and (2) individualized fatigue detection through supervised participant-specific models and non-fatigued-only anomaly detection. Mixed-effects…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1| Parameter | Description |
|---|---|
| Participants | 19 recreational runners, injury-free, regularly active |
| Ethics | Approved by the Human Research Ethics Committee, University College Dublin [ |
| Sensor device | Shimmer3 IMU, manufacured by SENSOR ID Srl, Campochiaro, Italy (lumbar placement at L3–L5) |
| Sampling rate | 256 Hz continuous recording throughout all trial segments |
| Measured signals | Tri-axial accelerometer |
| Derived signals | Acceleration magnitude |
| Experimental conditions |
400 m NF run at comfortable pace Standard beep-test fatiguing protocol 400 m fatigued (F) run post-beep test |
| Beep-test details | 20 m shuttles with decreasing inter-beep intervals; increasing required pace until exhaustion |
| Environment | Outdoor 400 m running track under consistent environmental conditions |
| Stride segmentation | Automated detection of foot-strike events in vertical acceleration;
each stride window time-normalized to fixed length |
| Dataset structure | Two labeled stride sets per participant: NF strides (from first 400 m run) and F strides (from post-fatigue 400 m run) |
| Labels | Binary class per stride: NF or F (fatigued) |
| Total samples | Varies by runner depending on cadence; approximately 50–80 strides per 400 m run, typically 100–160 total strides per participant |
| Model | Input Representation | Mean LOPO Acc. (%) | Std. Dev. (%) |
|---|---|---|---|
| Random Forest (RF) | Engineered features | 54.85 | 14.77 |
| RBF SVM | Engineered features | 55.94 | 15.18 |
| Gradient Boosting (GB) | Engineered features | 56.68 | 15.47 |
| 1D-CNN | Raw strides (8 ch., | 54.39 | 12.86 |
| Participant ID | # Strides | # F | # NF | CV Accuracy | CV F1-Score | CV AUC |
|---|---|---|---|---|---|---|
| 4 | 251 | 130 | 121 | 0.964000 | 0.966864 | 0.997115 |
| 5 | 198 | 98 | 100 | 0.994872 | 0.994872 | 1.000000 |
| 6 | 318 | 164 | 154 | 0.981101 | 0.981814 | 0.998506 |
| 7 | 159 | 75 | 84 | 0.968347 | 0.966422 | 0.997598 |
| 8 | 403 | 201 | 202 | 0.997531 | 0.997468 | 1.000000 |
| 9 | 407 | 221 | 186 | 0.995092 | 0.995506 | 0.999754 |
| 10 | 327 | 177 | 150 | 0.996970 | 0.997260 | 1.000000 |
| 11 | 368 | 207 | 161 | 0.970233 | 0.973141 | 0.995534 |
| 12 | 235 | 118 | 117 | 0.987234 | 0.987219 | 0.998551 |
| 13 | 156 | 100 | 56 | 0.929234 | 0.944859 | 0.984773 |
| 14 | 421 | 222 | 199 | 0.964342 | 0.965771 | 0.995683 |
| 15 | 261 | 124 | 137 | 0.950218 | 0.946664 | 0.994324 |
| 17 | 419 | 211 | 208 | 0.988038 | 0.988233 | 0.998910 |
| 18 | 268 | 104 | 164 | 0.973725 | 0.965086 | 0.996228 |
| 19 | 320 | 189 | 131 | 0.993750 | 0.994805 | 0.999595 |
| 20 | 420 | 209 | 211 | 0.988095 | 0.987834 | 0.999320 |
| 21 | 362 | 176 | 186 | 0.953044 | 0.951615 | 0.993748 |
| 22 | 322 | 169 | 153 | 0.959712 | 0.960859 | 0.997457 |
| 23 | 391 | 185 | 206 | 1.000000 | 1.000000 | 1.000000 |
| Mean across participants | 0.9766 | 0.9772 | 0.9972 | |||
| Acc. | F1 | AUC | |
|---|---|---|---|
| Mean | 97.66% | 0.98 | 1.00 |
| Std. | 1.92% | 0.02 | 0.00 |
| Min | 92.92% | 0.94 | 0.98 |
| Max | 100.00% | 1.00 | 1.00 |
| Participant | # Strides | # NF | # F | Recall on F | FPR on NF |
|---|---|---|---|---|---|
| 4 | 251 | 121 | 130 | 72.31% | 13.22% |
| 5 | 198 | 100 | 98 | 100.00% | 9.00% |
| 6 | 318 | 154 | 164 | 96.95% | 10.39% |
| 7 | 159 | 84 | 75 | 100.00% | 13.10% |
| 8 | 403 | 202 | 201 | 100.00% | 8.91% |
| 9 | 407 | 186 | 221 | 98.64% | 10.75% |
| 10 | 327 | 150 | 177 | 90.96% | 10.00% |
| 11 | 368 | 161 | 207 | 90.82% | 8.70% |
| 12 | 235 | 117 | 118 | 99.15% | 9.40% |
| 13 | 156 | 56 | 100 | 94.00% | 21.43% |
| 14 | 421 | 199 | 222 | 68.02% | 10.05% |
| 15 | 261 | 137 | 124 | 73.39% | 17.52% |
| 17 | 419 | 208 | 211 | 97.16% | 8.65% |
| 18 | 268 | 164 | 104 | 100.00% | 9.15% |
| 19 | 320 | 131 | 189 | 100.00% | 12.21% |
| 20 | 420 | 211 | 209 | 100.00% | 9.95% |
| 21 | 362 | 186 | 176 | 94.32% | 9.14% |
| 22 | 322 | 153 | 169 | 87.57% | 11.11% |
| 23 | 391 | 206 | 185 | 100.00% | 7.77% |
| Recall on F | FPR on NF | |
|---|---|---|
| Mean | 92.80% | 11.08% |
| Std. | 10.06% | 3.28% |
| Min | 68.02% | 7.77% |
| Max | 100.00% | 21.43% |
- —Consolidator Excellence Researcher Program of Obuda University
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Performance and Training · Lower Extremity Biomechanics and Pathologies · Knee injuries and reconstruction techniques
1. Introduction
Fatigue is a key factor influencing running performance [1], movement quality, and injury risk [2]. As fatigue develops, neuromuscular coordination, impact-loading characteristics, and segmental control change [3] in systematic ways that can be monitored using wearable sensors [4]. In particular, inertial measurement units (IMUs) provide a lightweight, low-cost solution for acquiring multi-axis acceleration and gyroscope data in uncontrolled environments, enabling detailed stride-level biomechanical analysis (see Figure 1). Recent advances in wearable technology have therefore motivated the use of IMU-based methods for detecting fatigue during running [1].
Despite the rapid growth of IMU applications in sports biomechanics [5], several limitations persist in current fatigue-detection approaches. Most existing studies rely heavily on simple time-domain metrics such as peak acceleration, root-mean-square (RMS) values, or stride regularity. While these measures capture coarse movement changes, they may fail to reflect subtle alterations in temporal structure, spectral content, or signal irregularity associated with fatigue. Additionally, many machine learning (ML) studies evaluate models within individuals, making it difficult to achieve generalization across runners. Personalized fatigue modeling [6], though critical for athlete-specific monitoring, remains underexplored. Finally, while deep learning has shown promise in gait classification and activity recognition, its integration with interpretable biomechanical features [7] in fatigue detection is limited.
Sports safety approach on specific use case, by Biró et al. [6].
IMU stride sequences (see Figure 2) exhibit structured temporal dynamics, periodicity, and motif-like patterns similar to biological sequences. Motivated by this analogy, the present work adopts a bioinformatics-inspired perspective, treating each stride as a sequence whose spectral and entropy characteristics reflect underlying neuromuscular changes.
From a broader motor control and neuroscience perspective, fatigue-related changes in stride dynamics can be interpreted as alterations in sensorimotor integration and proprioceptive feedback mechanisms. Gait variability and movement complexity have been widely studied as indicators of neuromuscular control, stability, and adaptability, where increased variability or entropy often reflects reduced precision in motor execution or changes in control strategies. Recent literature in biomechanics and neuroscience emphasizes that proprioceptive degradation, altered afferent feedback, and compensatory motor adaptations contribute to increased movement irregularity under fatigue. Accordingly, entropy-based descriptors derived from IMU signals provide a physiologically grounded proxy for assessing fatigue-induced disruptions in sensorimotor control and locomotor stability. This interpretation is consistent with recent systematic evidence linking proprioceptive interventions, movement variability, and postural stability in human locomotion [8].
Frequency-domain decomposition, spectral–entropy, and sample entropy (SampEn) have long been used in genomics and physiological signal analysis to quantify complexity and irregularity. These tools offer a powerful framework for identifying fatigue-driven alterations in stride dynamics [9] that may not be captured by traditional metrics. Inter-individual variability is another major challenge in biomechanics [10] and wearable sensor research [11]. Mixed-effects modeling provides a principled statistical approach for evaluating the fixed effect of fatigue while accounting for random differences between participants. Complementing this statistical framework, hybrid ML approaches—including Random Forests (RFs), Support Vector Machines (SVM), Gradient Boosting (GB), and shallow one-dimensional Convolutional Neural Networks (1D-CNNs)—offer robust classification performance using both handcrafted features and raw stride waveforms. Personalized models, such as supervised subject-specific classifiers and anomaly detectors trained on non-fatigued (NF) data only, further enhance sensitivity to individual fatigue signatures.
Recent advances in wearable sensor research [12,13,14] have demonstrated the potential of IMUs to monitor fatigue-induced changes in running biomechanics [15]. However, most existing studies are based heavily on simple time- and frequency-domain features such as peak acceleration, RMS, stride regularity, or mean vertical loading metrics. These approaches have shown moderate success in distinguishing fatigued from NF conditions [16], yet they often fail to capture the subtle neuromuscular adaptations reflected in the temporal complexity and spectral structure of stride signals. In contrast, the present work employs a bioinformatics-inspired sequence perspective, leveraging spectral–entropy, SampEn, and bandpower metrics to quantify stride complexity. This positions the proposed method beyond conventional feature engineering by explicitly targeting nonlinear and irregular biomechanical signatures of fatigue [17].
Moreover, many previous IMU fatigue studies evaluate classifiers within participants [18,19,20] or use train–test splits that inadvertently allow for subject-specific information to leak across folds, resulting in overestimation of model generalizability. Global models trained using strict leave-one-participant-out (LOPO) validation typically exhibit substantially lower performance, underscoring the challenge posed by inter-individual biomechanical variability. While a few studies acknowledge this variability, they rarely incorporate hierarchical statistical tools to formally separate fixed fatigue effects from random participant effects. The mixed-effects modeling employed in the present work directly addresses this gap, providing a statistically rigorous quantification of fatigue across individuals and enabling effect size interpretation that is often missing in purely machine learning approaches.
Another critical distinction from the existing literature lies in the integration of personalized fatigue modeling. Prior studies have proposed subject-specific classifiers but have generally lacked principled frameworks for anomaly detection or baseline-driven monitoring. By introducing both supervised participant-specific models and NF-only anomaly detectors, this study bridges biomechanical fatigue analysis with computational genomics–inspired deviation modeling. The result is a methodological progression from population-level fatigue detection toward individualized monitoring, which is essential for deployment in real-world sports analytics and digital coaching systems.
Finally, although deep learning architectures such as 1D CNNs and LSTMs have been explored for gait classification and activity recognition, their application to fatigue detection has been limited and often underperforms when inter-individual variability is high [21]. The present study demonstrates that shallow CNNs, although competitive, are outperformed by feature-based personalized models, confirming that domain-informed feature extraction and entropy-based descriptors remain highly effective for fatigue detection. This finding contrasts with the growing tendency to default to deep learning solutions, highlighting the importance of hybrid frameworks that balance interpretability, computational efficiency, and accuracy. Overall, the proposed framework advances the state of the art by combining entropy-driven stride sequence characterization [22], mixed-effects statistical modeling, global LOPO benchmarking, and personalized AI methods—an integration not previously demonstrated in IMU-based fatigue research. This unified approach enables both robust population-level insights and high-fidelity individualized fatigue monitoring, addressing long-standing limitations in generalizability, interpretability, and practical deployment.
Although IMU-based fatigue detection has been widely explored, existing approaches typically rely on simple time-domain features, within-participant evaluation, or black-box deep learning models that offer limited interpretability and poor cross-subject generalization. Prior studies rarely combine entropy-based complexity measures with hierarchical statistical modeling, and no existing work treats IMU stride sequences through a bioinformatics-inspired lens [23] that captures the motif-like structure, spectral complexity, and deviations from an individual’s biomechanical baseline. To address these gaps, this study introduces a unified framework that integrates (1) bioinformatics-inspired sequence modeling; (2) using spectral–entropy, SampEn, and frequency-domain bandpower feature extraction; (3) mixed-effects statistical models that separate population-level fatigue effects from inter-individual variability; and (4) hybrid machine learning models, including global LOPO classifiers and personalized supervised and anomaly-detection approaches. This methodological synergy is unique in the fatigue-detection literature and enables both statistically rigorous population-level inference and high-accuracy personalized monitoring, offering a comprehensive and interpretable approach for IMU-based fatigue assessment in real-world running scenarios [24]. Our results demonstrate a methodology that provides forward-looking expectations for robust IMU-based fatigue detection under controlled experimental conditions.
Contributions of This Work
This study introduces a comprehensive bioinformatics-inspired pipeline [25] for IMU-based fatigue detection in running [26], featuring the following contributions:
- A novel sequence-level representation of IMU strides [27], leveraging spectral–entropy, SampEn, and frequency-domain bandpower features.
- Mixed-effects modeling to quantify the fixed effect of fatigue on stride-level biomechanics [28] in 19 recreational runners.
- A hybrid ML framework including RF, SVM, GB, and shallow 1D-CNNs evaluated using LOPO cross-validation [29].
- Personalized fatigue detection using (1) supervised participant-specific RF models and (2) NF-only anomaly detection with One-Class SVMs.
- An end-to-end, open-source workflow integrating stride segmentation, feature extraction, mixed-effects statistics, global ML models, and personalized AI approaches.
2. Objectives
The primary objective of this study is to develop a comprehensive bioinformatics-inspired analysis pipeline [25] to detect fatigue-induced changes in biomechanics using lumbar-mounted IMU data (see Table 1). Building on recent advances in wearable sensing [30], spectral–entropy analysis, mixed-effects statistics, and hybrid machine learning, this work addresses key limitations in existing IMU-based fatigue research. The specific objectives of the study are as follows:
- To develop a bioinformatics-inspired sequence representation of IMU strides [31,32], treating each stride as a structured, multichannel time series amenable to spectral decomposition, entropy-based complexity analysis, and sequence-level comparisons.
- To extract a comprehensive set of time-, frequency-, and entropy-domain features, including dominant frequency, bandpower, spectral–entropy, and SampEn, that capture fatigue-related alterations in neuromuscular control and stride dynamics.
- To quantify the fixed effect of fatigue on stride biomechanics [33] using mixed-effects modeling, enabling statistically rigorous separation of within-participant and between-participant variability.
- To evaluate population-level fatigue classification models using LOPO cross-validation with Random Forests, Support Vector Machines, Gradient Boosting, and shallow 1D-CNNs trained on either handcrafted features or raw stride sequences.
- To develop personalized fatigue detection models, including (1) supervised participant-specific Random Forest classifiers and (2) NF-only anomaly detection using One-Class SVMs, thereby enabling individualized identification of fatigue signatures.
- To examine the synergy between statistical modeling, biomechanical interpretation, and machine learning, demonstrating how the integration of mixed-effects analysis with global and personalized AI models improves robustness, interpretability, and classification performance.
Together, these objectives support the development of a unified methodological framework combining biomechanical insight, bioinformatics-inspired sequence analysis, and hybrid machine learning for IMU-based fatigue detection in running.
3. Materials and Methods
The modeling and AI experiments were run on the following configurations: Mac Studio, manufactured by Apple Inc. in Cupertino, CA, USA. Apple M1 Ultra, 64 GB Memory on the Google Colab Pro platform [6].
3.1. Participants and Experimental Protocol
Nineteen healthy recreational runners participated in the study (see Table 1). All participants were regularly active, injury-free, and provided informed consent prior to testing. The sample size of nineteen participants is comparable to or exceeds that of many prior IMU-based biomechanics and fatigue studies that rely on intensive within-subject stride-level analysis rather than population inference. Importantly, the present study does not aim to estimate population prevalence or small between-group effects but rather aims to characterize fatigue-induced biomechanical changes at the stride level while explicitly accounting for inter-individual variability using mixed-effects modeling. Each participant contributed approximately 100–160 strides, resulting in over 6000 labeled stride samples, which provides substantial statistical power for detecting within-participant fatigue effects. Mixed-effects models further mitigate limitations of modest participant counts by leveraging repeated measures and hierarchical variance partitioning. Accordingly, while no a priori power analysis was performed, the observed large effect sizes (Cohen’s d up to 1.35) and highly significant fixed effects (all ) indicate that the study is sufficiently powered to support the reported conclusions at the biomechanical and stride sequence level. The study protocol was reviewed and approved by the Human Research Ethics Committee at University College Dublin [34,35]. Data were collected in three consecutive segments performed on an outdoor running track:
- NF run: Each participant completed a 400 m run at a comfortable, self-selected pace.
- Fatiguing protocol (beep test): Participants performed a 20 m shuttle run following auditory cues (“beeps”). The interval between beeps decreased progressively, requiring increasing running speed. The protocol terminated when the participant was unable to maintain the required pace for two consecutive shuttles, which is a standard operational definition of exhaustion in beep-test protocols. While no direct physiological markers such as blood lactate, heart-rate thresholds, or ratings of perceived exertion (RPE) were collected, the beep test is widely used as a validated proxy for inducing substantial metabolic and neuromuscular fatigue. Consequently, the fatigued condition represents a functionally fatigued state rather than a metabolically standardized one, and individual differences in fatigue tolerance are treated as an inherent component of the modeling framework rather than a source of noise.
- Fatigued (F) run: Immediately following the beep test, participants completed another 400 m run at the same comfortable pace, now in a fatigued state.
A Shimmer3 IMU was firmly mounted on the lumbar spine (approximately L3–L5), capturing (1) tri-axial acceleration ; (2) tri-axial angular velocity ; and (3) magnetometer data (not used directly in classification). All experimental procedures, including sensor placement and fatigue protocol execution, were performed by the original data collectors and are reported here as described in the source publication. According to the original dataset documentation, the IMU was mounted on the lower back in the lumbar region (approximately L3–L5). No additional details regarding anatomical landmarking procedures, inter-rater reliability, or placement verification were provided in the original study. As such, potential variability in sensor placement represents a limitation inherited from the dataset rather than from the present analysis. Each step cycle was annotated and segmented into individualized stride windows for analysis. Two additional derived channels were included:
Stride-level segmentation [28] was performed on the IMU data from the two 400 m runs (first in NF state, second in F state). Each stride was assigned a label NF or F according to its run segment.
3.2. Stride Segmentation and Sequence Representation
Signals were preprocessed using a fourth-order Butterworth low-pass filter (20 Hz cutoff). Strides were segmented by detecting consistent peaks in the vertical acceleration, corresponding to foot-strike events. Let denote the multichannel IMU signal at time t, where the channels correspond to
Each stride was then defined over a time-normalized window:
then resampled to a fixed length N points to allow for sequence-level modeling.
3.3. Experimental Environment
For device-validation experiments with DiscoveryMini (see Figure 3) (not used in the fatigue dataset), the sampling frequency was fixed at 30 Hz to ensure adequate temporal resolution while maintaining efficient data acquisition. Data were obtained directly in unprocessed (raw) form to preserve the fidelity of the recorded signals [6,36]. Specifically, acceleration data were captured in both m/s^2^ and mg units, where the sensor-reported gravitational acceleration of Earth corresponds to 9.8 m/s^2^. Real-time data transmission from the sensor to a host computer was achieved via the Bluetooth Low Energy (BLE) communication protocol [36].
3.4. Bioinformatics-Inspired Sequence Modeling
IMU stride sequences exhibit periodic, semi-structured patterns affected by neuromuscular fatigue. These characteristics are analogous to biological sequences, where frequency content, local irregularity, and motif structure provide insights into underlying biological processes. Motivated by this connection, we apply analysis tools adapted from computational genomics:
- Spectral decomposition [37] (analogous to Fourier-based genomic periodicity analysis) to quantify periodicity shifts under fatigue.
- Entropy-based complexity metrics (like sequence entropy and complexity metrics) to detect irregularity and loss of neuromuscular control [38].
- Anomaly detection analogous to identifying deviations or “mutations” from a participant’s normal biomechanical pattern, where fatigued strides deviate from the individual’s NF biomechanics.
This bioinformatics perspective enhances the sensitivity of IMU data analysis to subtle stride-level changes induced by fatigue or, in other words, the stride dynamics beyond traditional biomechanics [39]. While the proposed framework does not implement classical sequence-alignment algorithms (e.g., k-mer matching or homology scoring), the term bioinformatics-inspired is used to emphasize the conceptual transfer of sequence analysis principles—including entropy, spectral complexity, motif-like structure, and deviation detection—from genomics and physiological signal analysis to biomechanical stride sequences.
3.5. Time-Domain Features
For each stride and channel, we extracted classical biomechanical features including the following:
Root-Mean-Square (RMS):
Mean, variance, skewness, and kurtosis:
3.6. Spectral and Frequency-Domain Features
Using the one-sided Fast Fourier Transform (FFT), the magnitude spectrum is
Power spectral density (PSD):
Dominant frequency:
Spectral–entropy:
Bandpower features were computed over physiologically meaningful frequency band :
3.7. Sample Entropy and Complexity Features
Sample entropy ( ) quantifies stride irregularity:
where B is the number of matched sequences of length m, and A is the number of matched sequences of length , under tolerance r. Fatigue is hypothesized to increase signal irregularity, leading to a higher .
3.8. Stride-to-Stride Variability [40]
For a feature extracted across strides of one participant,
and the complexity of the stride-to-stride feature trajectory [32] is quantified using SampEn of the sequence . The trajectory SampEn captures higher-level fatigue trends. These metrics capture macro-level fatigue trends that complement micro-level stride features.
3.9. Mixed-Effects Modeling
To quantify the fixed effect of fatigue while accounting for participant-level variability, we employ linear mixed-effects models of the following form:
with participant-specific random intercepts , where (1) is a feature for stride i of participant j; (2) captures the systematic fatigue effect; (3) captures individual differences; and (4) . This hierarchical structure accounts for inter-individual biomechanics variability. Having established the biomechanical validity of the extracted fatigue-related features [1], we next assess their predictive utility in both global and personalized machine learning frameworks. Having identified robust biomechanical fatigue signatures, we now evaluate whether these features enable accurate classification under population-level and personalized ML paradigms.
3.10. Global Machine Learning Models
Several global models were evaluated, trained on pooled data using leave-one-participant-out (LOPO) cross-validation: (1) Random Forest (RF); (2) Support Vector Machine (SVM, RBF kernel); (3) Gradient Boosting (GB); and (4) shallow 1D-CNN for raw stride sequences. Each model receives either (1) handcrafted spectral–entropy features or (2) raw stride sequences . Performance metrics include accuracy, F1-score, and AUC.
3.11. Personalized Machine Learning Models
In fatigue detection, “anomalies” (outliers) correspond to fatigued strides. This approach is analogous to mutation or deviation detection in bioinformatics [41]. Two types of personalized models are developed: (1) Supervised personalized RF: A Random Forest is trained and cross-validated on each participant’s NF and F strides:
(2) NF-only anomaly detection: One-Class SVM trained on NF strides identifies fatigued strides as anomalies.
3.12. Synergy of Methods and Novelty of the Framework
The novelty of this work lies in the integration of (1) bioinformatics-inspired sequence modeling, treating strides as structured sequences with motif-like features; (2) frequency-domain and entropy-based complexity metrics; (3) hierarchical mixed-effects modeling to quantify fatigue effects; (4) hybrid ML models combining interpretable features with raw deep learning representations; and (5) personalized anomaly detection grounded in within-runner biomechanics. The result is a comprehensive framework for IMU-based fatigue detection that combines statistical interpretability, biomechanical validity, and machine learning performance.
4. Preprocessing
Table 2 shows that fatigue has a strong positive fixed effect on acceleration magnitude kurtosis, indicating sharper impact peaks and a more peaked distribution of acceleration under fatigue. The large z-score and p < 0.001 confirm this effect is highly significant. Random between-participant variance is modest but non-negligible.
The mean vertical acceleration (see Table 3) decreases significantly under fatigue (coef = −0.842; p < 0.001), indicating reduced vertical propulsion or altered body posture. This is one of the largest effect sizes among the features, reflecting a robust biomechanical adaptation.
Fatigue leads to a significant increase in lateral acceleration peaks (see Table 4), suggesting greater mediolateral trunk instability or reduced control. The large random variance indicates substantial participant-specific differences, consistent with individual running styles.
The gyroscope RMS in the lateral axis increases significantly under fatigue (see Table 5), reflecting higher rotational variability around the lumbar spine. This likely corresponds to reduced trunk stabilization and increased compensatory movement patterns.
Table 6 highlights the fatigue exerted a statistically significant fixed effect on all evaluated IMU features ( ). The strongest negative shift appeared in acc_z_mean, indicating reduced vertical acceleration during fatigued running. Large positive fatigue effects on acc_y_max and gyro_y_rms reflect increased mediolateral trunk motion and rotational variability, consistent with diminished neuromuscular control. The increase in acc_mag_kurt suggests sharper, more impulsive loading patterns under fatigue. Together, these findings highlight multidimensional changes in impact mechanics, stability, and rotational control during fatigued running.
Figure 4 provides a compact overview of the direction and magnitude of fatigue effects, while Figure 5 presents the corresponding confidence intervals and statistical uncertainty. The vertical acceleration mean (acc_z_mean) shows a strong negative shift under fatigue, while acc_mag_kurt, acc_y_max and gyro_y_rms exhibit positive fatigue effects, indicating sharper impact peaks, higher mediolateral acceleration, and greater lateral trunk rotation, respectively. All confidence intervals lie well away from zero, confirming the robustness of these fatigue-induced changes.
4.1. Effect Size Analysis for Mixed-Effects Models (Cohen’s d and Partial R2)
In addition to estimating fixed fatigue effects using mixed-effects models, we computed standardized effect size measures to quantify the magnitude of fatigue-induced changes in stride-level IMU features. Two families of effect sizes were evaluated: (1) standardized mean difference (Cohen’s d), and (2) variance-explained measures tailored to mixed-effects models, including the marginal , conditional , and partial of the fatigue effect.
4.1.1. Cohen’s d
For mixed-effects models, Cohen’s d was computed using the estimated fixed effect and the residual variance :
where is the square root of the model’s residual variance. This formulation provides a standardized effect size comparable to traditional between-group comparisons.
4.1.2. Marginal and Conditional R2
Following Nakagawa and Schielzeth [42], we computed
where and captures participant-level variance.
4.1.3. Partial R2 for the Fatigue Effect
To quantify the unique variance explained by fatigue, we used the following method [42]:
where t is the Wald z-statistic of the fatigue coefficient and approximates the effective residual degrees of freedom. This measure reflects the proportion of explainable variance attributable specifically to fatigue, controlling for random intercepts across participants.
These standardized effect sizes (see Table 7) indicate that fatigue-induced biomechanical changes are not only statistically significant but also practically meaningful, supporting their use as discriminative biomarkers in subsequent classification tasks [43]. The effect size analysis confirms that fatigue produces strong and meaningful alterations in stride biomechanics. Cohen’s d values ranged from moderate ( for gyro_y_rms) to very large ( for acc_z_mean), indicating substantial standardized separation between fatigued and NF stride distributions. Partial values (0.12–0.31) further demonstrate that fatigue explains a non-trivial proportion of variance in IMU features even after accounting for participant-level random effects. These results reinforce the robustness of the fatigue signatures detected by the mixed-effects models and justify the use of these features in downstream machine learning pipelines.
As shown in Figure 6, Cohen’s d values for the fixed effect of fatigue were large across all four representative features, with the strongest effect for acc_z_mean. The corresponding partial values (Figure 7) indicate that fatigue alone explains between 12% and 31% of the variance in these features, even after accounting for between-participant differences.
4.2. Signal Acquisition Data Validation
A secondary IMU prototype (DiscoveryMini) was used only to validate signal acquisition in parallel experiments; it was not used in the fatigue dataset. For modeling and validation purposes, this study initially used the ActiGraph GT9X wireless inertial sensor [25], followed by an ultralow-power, high-performance, three-axis linear accelerometer [44] (DiscoveryMini, incorporating the LIS2DH12) equipped with integrated EEPROM memory (see Figure 3). The accelerometer featured notification modalities, including an acoustic buzzer and vibration alerts. The device was powered by a 400 milliampere-hour (mAh) battery. The DiscoveryMini inertial measurement unit (IMU) supports a sampling frequency range from 1 Hz to 5.3 kHz and is capable of recording tri-axial acceleration (X, Y, and Z axes) with a dynamic range of up to ±16 g. This device was used for validation of the conducted research experiments.
5. Results
Global LOPO performance is modest, confirming that stride biomechanics vary strongly between runners and that a single population-level model struggles to generalize. In contrast, supervised personalized RF models achieve very high accuracy, F1, and AUC, indicating that fatigue patterns are highly consistent within individuals. Personalized NF-only anomaly detection also performs well, achieving strong fatigue recall with a manageable false positive rate. These results highlight the importance of individualized modeling [45] for reliable fatigue detection in wearable sensor biomechanics [46].
The dataset contains 6006 strides (2926 NF; 3080 F) described by 64 IMU-derived features (see Table 8). LOPO cross-validation reveals large variability in generalization across participants, with accuracies ranging from 0.35 to 0.89 and a mean accuracy of 0.5487. Given this substantial inter-individual variability, we next examine whether participant-specific classifiers and NF-only anomaly detectors provide more reliable and individualized fatigue detection. Although the total number of NF and fatigued strides is balanced at the dataset level, the number of strides per condition varies across participants due to differences in cadence, running speed, and fatigue tolerance. This imbalance reflects realistic inter-individual variability rather than experimental bias. To address this, mixed-effects models explicitly account for unequal sample sizes through hierarchical structure, and machine learning evaluations emphasize per-participant metrics and personalized modeling approaches rather than pooled accuracy alone.
The top 20 feature importance scores (see Table 9) show that fatigue is primarily reflected in vertical acceleration (acc_z_mean), signal magnitude metrics, kurtosis/skewness, and gyroscope-based energy measures—indicators of impact mechanics, stabilization, and rotational control. These findings (see Figure 8) justify personalized or hybrid modeling approaches in subsequent analyses (see Figure 9).
The NF-only personalized anomaly detectors achieve consistently high performance across most participants (see Figure 10), with a mean accuracy of 0.9061, mean F1-score of 0.9058, and mean AUC of 0.9670. Several participants reach near-perfect AUC values (1.0), indicating that their fatigued strides are clearly separable from their NF baseline. Performance is slightly lower for a few individuals (e.g., IDs 4, 14, and 15), suggesting more subtle or variable fatigue signatures. Overall, these results confirm that modeling fatigue as a deviation from individual NF biomechanics is highly effective. Receiver operating characteristic (ROC) curves were computed during model development to determine decision thresholds; however, due to space constraints and the focus on per-participant summary statistics, only the AUC, recall, and false positive rates are reported. Future extensions of this work will include participant-specific ROC visualizations and threshold optimization strategies for deployment-specific trade-offs.
Table 10 corroborates that globally trained models exhibit limited generalizability, primarily due to pronounced inter-individual variability. Consequently, we implemented a supervised, personalized modeling approach: a per-participant supervised Random Forest with stratified K-fold cross-validation. Specifically, for each participant, a Random Forest classifier is trained exclusively on that runner’s own NF and fatigued (F) strides, with model evaluation performed via within-participant stratified K-fold cross-validation.
The supervised personalized RF models (see Table 11) exhibit consistently excellent performance across participants (see Table 12), with a mean cross-validated accuracy of 0.9766, mean F1-score of 0.9772, and mean AUC of 0.9972 (see Table 13). For most runners, the classifier nearly perfectly separates NF and fatigued strides when trained exclusively on that individual’s data. Slightly lower performance for a few participants (e.g., IDs 13, 15, and 21) suggests more subtle or variable fatigue patterns, but overall the results indicate that within-participant learning captures highly stable and discriminative fatigue signatures. Below is listed the NF-only anomaly detection (nf_personal_df) (see Table 14 with Summary Table 15).
For participant 4, (see Table 16) fatigue increases stride-to-stride variability (see Table 17 and Figure 12) in both the acceleration and gyroscope magnitude RMS signals, as reflected in positive CV for acc_mag_rms and gyro_mag_rms. SampEn increases markedly for acc_mag_rms, indicating more irregular stride-to-stride fluctuations in overall acceleration magnitude under fatigue. In contrast, gyro_mag_rms shows a small decrease in SampEn, suggesting that rotational control becomes more variable in amplitude (higher CV) but not necessarily more irregular in temporal structure. This pattern illustrates that fatigue affects both variability and complexity of trunk motion in a feature-specific way.
Stride-to-stride variability (see Table 16 and Table 17) reflects the stability and regularity of locomotion and is known to change under fatigue [48]. Across participants, fatigued strides generally exhibited higher variance and coefficient of variation (CV) in both acc_mag_rms and gyro_mag_rms, indicating less stable trunk acceleration and rotational control. SampEn, which quantifies temporal irregularity of feature trajectories, also tended to increase under fatigue for acc_mag_rms, suggesting less predictable stride-to-stride structure. Gyroscope-based SampEn changes were more participant-specific, consistent with individual strategies of compensatory stabilization during fatigue. These findings align with biomechanical theory: fatigue degrades neuromuscular control, leading to more variable and less regular stride patterns.
6. Discussion
This study introduced a unified, bioinformatics-inspired framework for IMU-based fatigue detection that integrates spectral–entropy features, mixed-effects modeling, and hybrid machine learning approaches. The primary contribution of this study lies in identifying robust, individualized fatigue signatures rather than establishing population-wide classification models applicable without personalization. By combining population-level inference with individual-specific modeling, the results provide new insight into how fatigue alters lumbar-mounted IMU stride signatures and demonstrate why personalized methods are critical for robust wearables-based fatigue monitoring [49].
6.1. Population-Level Fatigue Signatures Revealed by Mixed-Effects Modeling
Mixed-effects analyses revealed strong and multidimensional fatigue effects across all evaluated IMU-derived features. The largest standardized shifts were observed in the vertical acceleration mean, mediolateral acceleration peaks, gyroscope-derived RMS values, and impact kurtosis, indicating reduced vertical propulsion, diminished trunk stability, and sharper impact-loading patterns under fatigue. The confidence intervals in the forest plots were well separated from zero, confirming that these effects were consistent across runners despite substantial biomechanical heterogeneity. The inclusion of standardized effect sizes (Cohen’s d and partial ) further strengthened interpretability, showing that fatigue explained up to ∼31% of the variance in key features (e.g., acc_z_mean). These findings demonstrate that IMU stride sequences encode statistically meaningful fatigue signatures that are both large in magnitude and robust across individuals.
6.2. Limited Generalization of Global Models Highlights Inter-Individual Variability
In contrast with the clear population-level effects, global LOPO models achieved modest and highly variable accuracy across participants. This outcome is consistent with prior reports on wearable sensor biomechanics and confirms that stride-level running patterns exhibit strong inter-individual specificity. Even with 64 carefully engineered spectral–entropy and time-domain features, global classifiers struggled to generalize to unseen runners. This discrepancy between strong statistical fatigue effects and weak LOPO classification performance underscores a key insight: population-level biomechanical signatures do not translate directly into population-level predictive models. Individualized neuromuscular strategies, anatomical differences, posture, and running styles collectively limit the transferability of global fatigue models.
6.3. Personalized Supervised Models Achieve Near-Perfect Performance
When models were trained within participants, supervised personalized Random Forest classifiers achieved exceptionally high performance (mean accuracy ; mean AUC ). This near-perfect separability confirms that fatigue alters stride biomechanics in consistent and individualized ways that are not preserved under cross-participant pooling. These results show that personalized calibration is essential for applications requiring high reliability, such as injury-risk monitoring [50], training-load management, or athlete-specific coaching systems. They also highlight the limitations of black-box deep learning approaches that do not incorporate individualized structure or interpretable biomechanical features.
6.4. NF-Only Personalized Anomaly Detection Supports Minimal Calibration Workflows
The NF-only One-Class SVM models demonstrated that fatigued strides can be reliably detected as deviations from a runner’s NF baseline, even without any fatigued training data. With a mean AUC of , this method enables real-world deployments where collecting labeled fatigued data is impractical. Conceptually, this aligns with bioinformatics-style anomaly detection, where deviations from a canonical sequence represent functional alterations. Here, fatigued movement patterns behave as “deviations” of an individual’s normal stride structure. This paradigm offers a lightweight, interpretable, and calibration-efficient alternative to supervised learning.
6.5. Entropy and Variability Metrics Capture Neuromuscular Degradation
Entropy and stride-to-stride variability analyses provided additional mechanistic insight into fatigue-induced changes. Fatigue generally increased variance, coefficient of variation, and SampEn, particularly for acceleration magnitude features. These findings are consistent with fatigue-induced reductions in rhythmic stability and neuromuscular control. The feature-specific differences—such as stronger entropy increases in acceleration-derived metrics than in gyroscope-derived metrics—suggest that fatigue does not affect all neuromuscular pathways equally. Instead, runners adopt individualized compensatory strategies, as reflected in the heterogeneity observed in the gyroscope-based entropy changes. From a neuromuscular control standpoint, the observed increase in entropy with fatigue likely reflects a shift toward less predictable motor output. As fatigue progresses, altered motor unit recruitment, delayed afferent feedback, and reduced synchronization between agonist and stabilizing muscles may lead to noisier and less regular stride patterns. Such changes are consistent with motor control variability theories, which suggest that fatigue constrains the nervous system’s ability to maintain stable and repeatable movement trajectories. In this context, higher sample entropy indicates a loss of temporal structure and diminished fine-grained control rather than random noise, providing a physiologically meaningful marker of fatigue-induced neuromuscular degradation. These observations further support the interpretation of entropy-based IMU features as surrogate markers of sensorimotor adaptability rather than purely statistical descriptors and align with broader evidence that proprioceptive alterations and sensorimotor adaptation play a central role in fatigue-related movement variability [8].
6.6. A Bioinformatics-Inspired Perspective Enhances Sensitivity and Interpretability
The fatigue detection is conceptually analogous to deviation or mutation detection in biological sequences, where departures from an individual baseline carry more discriminative information than absolute population-level patterns. Treating stride windows as structured sequences enabled the application of spectral–entropy, SampEn, and bandpower measures typically used in genomics and physiological-signal analysis. This perspective allowed for the detection of motif-like changes and nonlinear irregularities that are not captured by traditional biomechanical metrics. The integration of multilevel modeling, interpretable spectral–entropy features, and personalized ML represents a methodological advancement over existing IMU fatigue studies, which often rely on narrow feature sets, within-subject evaluation only, or black-box deep networks with limited interpretability.
6.7. Practical Implications for Wearable Monitoring and Sports Performance
The results demonstrate that a single lumbar-mounted IMU can provide a rich representation of fatigue-related changes in trunk biomechanics. Global models offer statistically meaningful insight into population-level movement adaptations, while personalized models—especially minimal-calibration NF-only detectors—are best suited for real-time athlete monitoring, coaching feedback, digital twins, and adaptive training systems. Altogether, the proposed framework establishes a scalable and statistically grounded pathway from biomechanical interpretability to deployable, personalized AI systems, offering a scalable blueprint for next-generation wearable fatigue analytics.
7. Limitations and Future Work
All experimental design choices and data acquisition procedures reflect those of the original dataset and were not modified or extended in the present analysis. Despite the methodological breadth and strong empirical results, several limitations must be acknowledged to contextualize the findings and guide future research. First, the study relies on a single dataset collected from nineteen recreational runners using a lumbar-mounted IMU. Although mixed-effects modeling confirmed robust fatigue effects across participants, the limited sample size and homogeneous cohort restrict the generalizability of the global models. Future work should validate the proposed framework on larger and more diverse populations, including elite athletes, sport teams [51], different age groups, and clinical populations with altered gait patterns. Second, while the present study focuses on a single sensor location, fatigue manifests through multi-joint adaptations that may be more comprehensively captured using multi-IMU configurations or complementary sensing modalities (e.g., EMG; footstrike pressure insoles). Evaluating how multi-sensor fusion improves entropy-based or sequence-level modeling represents an important next step. Third, the beep-test protocol induces substantial metabolic and neuromuscular fatigue, but it captures only one dimension of athlete fatigue. Additional protocols—such as prolonged submaximal running, interval-based fatigue, or cognitive–motor dual-task fatigue—should be investigated to evaluate whether the entropy- and variability-based signatures generalize across fatigue types. Fourth, global machine learning models demonstrated limited generalization (LOPO accuracy ), highlighting pronounced inter-individual variability in running biomechanics. Although personalized classifiers and NF-only anomaly detection provided excellent performance, they require participant-specific data for calibration. Developing hybrid transfer-learning approaches, personalized model initialization, or meta-learning strategies may reduce the dependence on intensive per-subject calibration. Finally, entropy, bandpower, and stride-to-stride variability metrics were extracted using fixed parameter choices (e.g., SampEn parameters; FFT windowing). Adaptive or data-driven parameter optimization could further improve sensitivity to fatigue-induced micro-dynamics, and real-time implementations must consider computational efficiency and hardware constraints. Overall, future work will focus on validating the proposed framework across broader populations, evaluating alternative fatigue paradigms, integrating multimodal wearable data, and developing adaptive or real-time personalized fatigue-monitoring models suitable for embedded or edge-computing systems. Several of the methodological limitations identified in this study are inherited from the publicly available dataset used for analysis. Future extensions of this research will involve the collection of new experimental data using a fully standardized acquisition protocol designed to reduce sources of measurement uncertainty identified in the present dataset. In particular, sensor placement will be explicitly standardized by positioning a single IMU centrally over the lumbar spine, aligned with the midline at approximately the L3–L5 vertebral level using palpable anatomical landmarks (e.g., iliac crest line), and secured with an elastic belt to minimize relative motion. All sensors will be mounted by the same trained experimenter following a consistent protocol, enabling assessment of inter-session and inter-rater reliability. Such a protocol will allow for systematic evaluation of placement variability effects and further strengthen the robustness and reproducibility of entropy- and variability-based fatigue detection in real-world running scenarios.
Ethical Considerations
This study was conducted in full compliance with institutional, national, and international ethical standards for research involving human participants. The experimental protocol, including participant recruitment, data collection procedures, and IMU instrumentation, was reviewed and approved by the Human Research Ethics Committee at University College Dublin (UCD), ensuring adherence to the principles outlined in the Declaration of Helsinki. All participants were recreational runners, injury-free at the time of recruitment, and engaged voluntarily in the study procedures. The methodological contributions of this study may eventually inform real-time athlete monitoring, rehabilitation support, or occupational fatigue detection. While these applications have potential benefits, careful consideration is required to ensure that such systems are not used for surveillance, coercion, or performance pressure without adequate consent and oversight. Ethical deployment of wearable-based fatigue analytics should always prioritize the autonomy, well-being, and privacy of individuals. Overall, this research complied with all relevant ethical standards and demonstrates the responsible collection, processing, and interpretation of human sensor data within a scientific context.
8. Conclusions
This study introduced a unified, bioinformatics-inspired framework for IMU-based fatigue detection that combines mixed-effects statistical modeling, entropy- and spectral-based sequence characterization, stride-to-stride variability metrics, and hybrid machine learning approaches. Using a single lumbar-mounted IMU, we demonstrated that fatigue induces consistent, multidimensional, and statistically robust alterations in trunk acceleration and rotational dynamics across runners. Effect size analyses, including partial values ranging from 0.12 to 0.31, confirmed that fatigue explains a substantial proportion of variance in key biomechanical features, while increases in variability and SampEn highlight diminished neuromuscular control under fatigued conditions. Global population-level classifiers evaluated with LOPO cross-validation achieved only moderate accuracy, underscoring the considerable inter-individual variability in running biomechanics and fatigue responses. In contrast, personalized learning approaches—both supervised Random Forest models and NF-only anomaly detectors—provided highly accurate within-athlete fatigue classification, frequently approaching perfect discrimination (AUC ≈ 1.0). These findings confirm that fatigue signatures are strongly individual-specific and that personalized calibration is essential for reliable monitoring in real-world environments. By integrating entropy, spectral features, and variability metrics, the proposed framework captures subtle biomechanical adaptations that are often invisible to traditional time-domain measures alone. From a deployment perspective, the bioinformatics-inspired framework is well suited for real-time wearable applications. The extracted spectral–entropy and variability features are computationally lightweight and can be computed online on embedded processors or mobile devices. Personalized calibration strategies, such as NF-only baseline learning or short supervised warm-up sessions, enable rapid individual adaptation without requiring extensive labeled fatigue data. These properties support practical integration into field-based monitoring systems, digital coaching platforms, and athlete-centric decision support tools operating in uncontrolled training and competition environments. Prospective extensions may integrate multimodal sensor data, adaptive online learning strategies, and digital-twin paradigms to support real-time, personalized fatigue analytics and to expand the generalizability and deployment of this framework across heterogeneous movement tasks and environments. All proposed feature extraction and classification steps are computationally lightweight and suitable for real-time or near-real-time execution on embedded or mobile platforms.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Olaya-Cuartero J. Lopez-Arbues B. Jiménez-Olmedo J. Villalón-Gasch L. Influence of fatigue on the modification of biomechanical parameters in endurance running: A systematic review Int. J. Exerc. Sci.2024171377139110.70252/lllt 329339807294 PMC 11728577 · doi ↗ · pubmed ↗
- 2Rasmussen M. Laumann K. The evaluation of fatigue as a performance shaping factor in the petro-hra method Reliab. Eng. Syst. Saf.202019410618710.1016/j.ress.2018.06.015 · doi ↗
- 3Lam H. Middleton H. Phillips S. The effect of self-selected music on endurance running capacity and performance in a mentally fatigued state J. Hum. Sport Exerc.20211789490810.14198/jhse.2022.174.16 · doi ↗
- 4Marotta L. Buurke J. Beijnum B. Reenalda J. Towards machine learning-based detection of running-induced fatigue in real-world scenarios: Evaluation of imu sensor configurations to reduce intrusiveness Sensors 202121345110.3390/s 2110345134063478 PMC 8156769 · doi ↗ · pubmed ↗
- 5Calle-Jaramillo G. Palacio E. Rojas-Jaramillo A. González-Jurado J. Effects of fatigue induced by the running-based anaerobic sprint test on the performance in execution time and decision-making in technical-tactical actions in soccer (passing and driving) in a laboratory situation Teorìâ Ta Metod. Fìzičnogo Vihovannâ20232376276910.17309/tmfv.2023.5.15 · doi ↗
- 6BiróA. Cuesta-Vargas A.I. Szilágyi L. AI-Assisted Fatigue and Stamina Control for Performance Sports on IMU-Generated Multivariate Times Series Datasets Sensors 20242413210.3390/s 2401013238202992 PMC 10781393 · doi ↗ · pubmed ↗
- 7Radzak K. Stickley C. Fatigue-induced hip-abductor weakness and changes in biomechanical risk factors for running-related injuries J. Athl. Train.2020551270127610.4085/1062-6050-531-1932946577 PMC 7740065 · doi ↗ · pubmed ↗
- 8Ghai S. Ghai I. Narciss S. Influence of taping on joint proprioception: A systematic review with between and within group meta-analysis BMC Musculoskelet. Disord.20242548010.1186/s 12891-024-07571-238890668 PMC 11186105 · doi ↗ · pubmed ↗
