# Distinguishing post-COVID from long-COVID in adults: Development and validation of a biomarker signature using targeted proteomics and machine learning in a cross-sectional observational study

**Authors:** Franziska Meyer, Stephan Traidl, Milad Ameri, Anita Dreher, Nevine Abu-Rashed-Kufs, Jan Vontobel, Matthias Möhrenschlager, Hans-Werner Duchna, Felicia Sandberg, Marie-Charlotte Brüggen

PMC · DOI: 10.1371/journal.pone.0338451 · PLOS One · 2026-02-27

## TL;DR

This study identifies a biomarker signature to distinguish long-COVID from post-COVID recovery using proteomics and machine learning.

## Contribution

A novel biomarker signature using proteomics and machine learning to differentiate long-COVID from post-COVID recovery.

## Key findings

- LC patients showed distinct clinical and proteomic profiles compared to PC patients.
- A Random Forest classifier achieved 89% accuracy using LAMP3, CKAP4, and KRT19 as biomarkers.
- LC patients had higher fatigue and neurocognitive symptoms, while PC patients had greater pulmonary impairment.

## Abstract

COVID-19 can have diverse clinical manifestations, ranging from asymptomatic infection to critical illness with multiorgan involvement. While many patients recover fully, others develop long-COVID, a heterogeneous condition marked by persistent symptoms beyond the acute phase. The immunological pathomechanisms between long-COVID and other post-acute recovery states remain unclear.

To characterize and compare clinical, pulmonary, and proteomic profiles of patients with long-COVID (LC) and those recovering from severe COVID-19 without long-COVID (post-severe-COVID, PC), and to evaluate the predictive potential of machine learning–based biomarker analysis.

In this monocentric, prospective observational study with a cross-sectional design, patients undergoing rehabilitation were included at admission. Clinical data, detailed symptom profiles, and lung function testing, including diffusing capacity of the lungs, were collected. Serum proteomics covering immune response and inflammation panels was performed, and a Random Forest classifier was applied to identify biomarkers differentiating LC and PC.

LC (n = 24) patients were younger (52 years vs. 58 years in PC), predominantly female (66.7% vs. 30.0% in PC), and reported fatigue, neurocognitive symptoms, and exercise intolerance, whereas PC (n = 40) patients showed greater pulmonary impairment, as shown by reduced diffusing capacity (46% vs. 72.5% in LC p<0.001). Proteomic profiling revealed distinct immune and inflammatory signatures between groups. Applying a random forest classification algorithm, we were able to distinguish between the LC and the PC group with a high degree of accuracy of around 89%, using LAMP3 (Lysosome-associated membrane glycoprotein 3), CKAP4 (cytoskeleton associated protein 4) and KRT19 (Keratin 19).

This study introduces a novel characterization of patients recovering from severe COVID-19 without long-COVID, enabling clearer differentiation between persistent and recovering trajectories. Combining clinical data, pulmonary function, and proteomic machine learning analysis provides insight into post-acute COVID-19 biology and identifies candidate biomarkers for improved diagnosis.

## Linked entities

- **Genes:** LAMP3 (lysosome associated membrane protein 3) [NCBI Gene 27074], CKAP4 (cytoskeleton associated protein 4) [NCBI Gene 10970], KRT19 (keratin 19) [NCBI Gene 3880]
- **Diseases:** COVID-19 (MONDO:0100096)

## Full-text entities

- **Genes:** IL15RA (interleukin 15 receptor subunit alpha) [NCBI Gene 3601] {aka CD215}, IFNB1 (interferon beta 1) [NCBI Gene 3456] {aka IFB, IFF, IFN-beta, IFNB}, IFNA1 (interferon alpha 1) [NCBI Gene 3439] {aka IFL, IFN, IFN-ALPHA, IFN-alphaD, IFNA13, IFNA@}, LAMP3 (lysosome associated membrane protein 3) [NCBI Gene 27074] {aka CD208, DC LAMP, DC-LAMP, DCLAMP, LAMP, LAMP-3}, IL5 (interleukin 5) [NCBI Gene 3567] {aka EDF, IL-5, TRF}, IL7 (interleukin 7) [NCBI Gene 3574] {aka IL-7, IMD130}, CKAP4 (cytoskeleton associated protein 4) [NCBI Gene 10970] {aka CLIMP-63, CLIMP63, ERGIC-63, p63}, CD274 (CD274 molecule) [NCBI Gene 29126] {aka ADMIO5, B7-H, B7H1, PD-L1, PDCD1L1, PDCD1LG1}, CXCL5 (C-X-C motif chemokine ligand 5) [NCBI Gene 6374] {aka ENA-78, SCYB5}, CLEC4C (C-type lectin domain family 4 member C) [NCBI Gene 170482] {aka BDCA-2, BDCA2, CD303, CLECSF11, CLECSF7, DLEC}, CSF1 (colony stimulating factor 1) [NCBI Gene 1435] {aka CSF-1, MCSF, PG-M-CSF}, CLEC4A (C-type lectin domain family 4 member A) [NCBI Gene 50856] {aka CD367, CLECSF6, DCIR, DDB27, HDCGC13P, LLIR}, IL6 (interleukin 6) [NCBI Gene 3569] {aka BSF-2, BSF2, CDF, HGF, HSF, IFN-beta-2}, CRP (C-reactive protein) [NCBI Gene 1401] {aka PTX1}, IL17A (interleukin 17A) [NCBI Gene 3605] {aka CTLA-8, CTLA8, IL-17, IL-17A, IL17, ILA17}, IRF9 (interferon regulatory factor 9) [NCBI Gene 10379] {aka IRF-9, ISGF3, ISGF3G, p48}, MMP1 (matrix metallopeptidase 1) [NCBI Gene 4312] {aka CLG}, IL15 (interleukin 15) [NCBI Gene 3600] {aka IL-15}, IL10RB (interleukin 10 receptor subunit beta) [NCBI Gene 3588] {aka CDW210B, CRF2-4, CRFB4, D21S58, D21S66, IBD25}, CD83 (CD83 molecule) [NCBI Gene 9308] {aka BL11, HB15}, CCL2 (C-C motif chemokine ligand 2) [NCBI Gene 6347] {aka GDCF-2, HC11, HSMCR30, MCAF, MCP-1, MCP1}, IL17C (interleukin 17C) [NCBI Gene 27189] {aka CX2, IL-17C}, KRT19 (keratin 19) [NCBI Gene 3880] {aka CK19, K19, K1CS}, JUN (Jun proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 3725] {aka AP-1, AP1, c-Jun, cJUN, p39}, IGHE (immunoglobulin heavy constant epsilon) [NCBI Gene 3497] {aka IgE}, ITGA11 (integrin subunit alpha 11) [NCBI Gene 22801] {aka HsT18964}, PLXNA4 (plexin A4) [NCBI Gene 91584] {aka FAYV2820, PLEXA4, PLXNA4A, PLXNA4B, PRO34003}, ITGB6 (integrin subunit beta 6) [NCBI Gene 3694] {aka AI1H}, CXCL1 (C-X-C motif chemokine ligand 1) [NCBI Gene 2919] {aka FSP, GRO1, GROa, MGSA, MGSA-a, NAP-3}, IL10 (interleukin 10) [NCBI Gene 3586] {aka CSIF, GVHDS, IL-10, IL10A, TGIF}
- **Diseases:** pulmonary restriction (MESH:D002313), cognitive dysfunction (MESH:D003072), Post-severe COVID (MESH:D045169), COVID-19 long hauler (MESH:D000094024), organ damage (MESH:D000092124), impaired lung diffusion (MESH:D017563), cardiac involvement (MESH:D006331), pulmonary fibrosis (MESH:D011658), diffusion impairment (MESH:D008228), myalgia (MESH:D063806), neurocognitive (MESH:D019965), Infection (MESH:D007239), blood coagulation (MESH:D001778), respiratory syndromes (MESH:D012120), COVID-19 (MESH:D000086382), post (MESH:D000094025), cough (MESH:D003371), thrombosis (MESH:D013927), Symptom (MESH:D012816), respiratory symptoms (MESH:D012818), deaths (MESH:D003643), coronavirus (MESH:D018352), metabolic (MESH:D008659), chronic fatigue (MESH:D015673), neurological complaints (MESH:D009461), acute respiratory distress syndrome (MESH:D012128), fever (MESH:D005334), concentration difficulties (MESH:C567712), respiratory limitations (MESH:D012131), ventricular dilatation (MESH:C566255), COPD (MESH:D029424), fatigue (MESH:D005221), pneumonia (MESH:D011014), Dyspnea (MESH:D004417), lung disease (MESH:D008171), endothelial dysfunction (MESH:D014652), muscle weakness (MESH:D018908), pulmonary injury (MESH:D055370), Inflammation (MESH:D007249), fibrotic diseases (MESH:D004194), critical illness (MESH:D016638), exercise intolerance (MESH:C564972), PC (MESH:D015324), sleep disorders (MESH:D012893), Flu (MESH:D007251)
- **Chemicals:** Heparin (MESH:D006493), Clavulanic acid (MESH:D019818), Rivaroxaban (MESH:D000069552), Ceftriaxon (MESH:D002443), Piperacillin-Tazobactam (MESH:D000077725), 2Amoxicillin (-), Imdevimab (MESH:C000711488), oligonucleotide (MESH:D009841), Casirivimab (MESH:C000711487), nitric oxide (MESH:D009569), carbon monoxide (MESH:D002248), cortisone (MESH:D003348), vitamin D (MESH:D014807), Tocilizumab (MESH:C502936), oxygen (MESH:D010100)
- **Species:** Homo sapiens (human, species) [taxon 9606], Gammacoronavirus (genus) [taxon 694013], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12948100/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12948100/full.md

## References

60 references — full list in the complete paper: https://tomesphere.com/paper/PMC12948100/full.md

---
Source: https://tomesphere.com/paper/PMC12948100