# Longitudinal modeling of Post-COVID-19 condition over three years: A machine learning approach using clinical, neuropsychological, and fluid markers

**Authors:** Julia Walders, Sophie Wetz, Ana Sofia Costa, Anna Hofmann, Jörg B. Schulz, Kathrin Reetz, Ravi Dadsena

PMC · DOI: 10.1038/s41598-026-37635-3 · Scientific Reports · 2026-02-14

## TL;DR

This study uses machine learning to track and classify the health status of post-COVID-19 patients over three years, identifying key markers for monitoring and managing their condition.

## Contribution

The study introduces a machine learning framework for longitudinal classification of post-COVID-19 condition stages using clinical, neuropsychological, and fluid markers.

## Key findings

- Gradient boosting methods achieved F1-scores close to or above 90% in classifying patient health stages.
- Inflammatory markers were the most informative predictors for distinguishing follow-up stages.
- Classification performance improved with longer time intervals between visits, indicating diverging patient phenotypes over time.

## Abstract

Post-COVID-19 condition (PCC) manifests with prolonged, heterogeneous symptoms challenging both, diagnosis and therapeutic management. This three-year longitudinal study analyzed data from 93 adults (mean age of 48.9 ± 14.0, 60 female) after confirmed SARS-CoV-2 infection. Every follow-up visit included clinical, neuropsychological, and laboratory assessments, capturing multidimensional indicators of patient health. A machine learning framework was implemented to classify temporal stage of patient health status, identify visit-specific predictive markers, and manage incomplete data using both native handling in tree-based models and explicit imputation techniques. Gradient boosting methods consistently achieved the best performance across all visit comparisons, achieving F1-scores close to or above 90%. Classification performance improved with greater time intervals between visits, suggesting progressive divergence in patient phenotypes over time. For discriminating follow-up stages, inflammatory markers emerged as the most informative predictors, followed by SARS-CoV-2 antibody levels and neuropsychiatric measures for fatigue and cognitive performance. Interpretability analyses using SHAP and LIME confirmed the contribution of these features, while revealing shifts in feature relevance across years. These findings highlight the utility of machine learning in characterizing follow-up stage separability in PCC and offer clinically interpretable insights that prioritize immune and neuropsychological measures for monitoring and risk-stratified follow-up.

The online version contains supplementary material available at 10.1038/s41598-026-37635-3.

## Full-text entities

- **Genes:** IL2 (interleukin 2) [NCBI Gene 3558] {aka IL-2, TCGF, lymphokine}, GFAP (glial fibrillary acidic protein) [NCBI Gene 2670] {aka ALXDRD}, CRP (C-reactive protein) [NCBI Gene 1401] {aka PTX1}, IL6 (interleukin 6) [NCBI Gene 3569] {aka BSF-2, BSF2, CDF, HGF, HSF, IFN-beta-2}, NEFL (neurofilament light chain) [NCBI Gene 4747] {aka CMT1F, CMT2E, CMTDIG, NF-L, NF68, NFL}, CXCL8 (C-X-C motif chemokine ligand 8) [NCBI Gene 3576] {aka GCP-1, GCP1, IL8, LECT, LUCT, LYNAP}, S (surface glycoprotein) [NCBI Gene 43740568] {aka spike glycoprotein}, IL10 (interleukin 10) [NCBI Gene 3586] {aka CSIF, GVHDS, IL-10, IL10A, TGIF}, ITIH2 (inter-alpha-trypsin inhibitor heavy chain 2) [NCBI Gene 3698] {aka H2P, ITI-HC2, SHAP}
- **Diseases:** cognition (MESH:D003072), PCC (MESH:D000094024), impairments in logical reasoning, executive functions, and verbal memory (MESH:C537032), neuronal injury (MESH:D009410), TIA (MESH:D002546), depression (MESH:D003866), infection (MESH:D007239), COVID-19 (MESH:D000086382), neuropsychiatric (MESH:C000631768), death (MESH:D003643), excessive daytime sleepiness (MESH:D006970), T cell dysregulation (MESH:D016399), smell disturbances (MESH:D000857), chronic fatigue (MESH:D015673), PTSD (MESH:D013313), Fatigue (MESH:D005221), COPD (MESH:D029424), overweight (MESH:D050177), D (MESH:D014808), Anxiety and Depression (MESH:D001007), Neuroinflammatory (MESH:D000090862), neuropsychiatric symptoms (MESH:D001523), neuropsychological impairment (MESH:D060825), PNP (MESH:D011115), daytime sleepiness (MESH:D012893), Inflammatory (MESH:D007249)
- **Species:** Homo sapiens (human, species) [taxon 9606], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12909961/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12909961/full.md

## References

2 references — full list in the complete paper: https://tomesphere.com/paper/PMC12909961/full.md

---
Source: https://tomesphere.com/paper/PMC12909961