An unsupervised decision-support framework for multivariate biomarker analysis in athlete monitoring
Fernando Barcelos Rosito, Sebasti\~ao De Jesus Menezes, Simone Ferreira Sturza, Adriana Seixas, Muriel Figueredo Franco

TL;DR
This paper introduces an unsupervised multivariate framework for athlete biomarker analysis that identifies latent physiological states, improving interpretability and detection of silent risks without injury labels.
Contribution
It presents a novel modular computational approach combining clustering and stability analysis to interpret multivariate biomarker data in athlete monitoring.
Findings
Identifies physiological profiles distinguishing damage types.
Detects silent risk phenotypes missed by univariate methods.
Demonstrates robustness in high-dimensional and augmented data scenarios.
Abstract
Purpose. Athlete monitoring is constrained by small cohorts, heterogeneous biomarker scales, limited feasibility of repeated sampling, and the lack of reliable injury ground truth. These limitations reduce the interpretability and utility of traditional univariate and binary risk models. This study addresses these challenges by proposing an unsupervised multivariate framework to identify latent physiological states in athletes using real data. Methods. We propose a modular computational framework that operates in the joint biomarker space, integrating preprocessing, clinical safety screening, unsupervised clustering, and centroid-based physiological interpretation. Profiles are learned exclusively from amateur soccer players during a competitive microcycle. Synthetic data augmentation evaluates robustness and scalability. Ward hierarchical clustering supports monitoring and etiological…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
