# Molecular Classification of Patients With COVID‐19 Based on Transcriptional Profiling

**Authors:** Hongyu Liu, Ying Zheng, Xiaoyan Deng, Mengxue Li, Di He, Wenting Zuo, Yitian Xu, Xuhui Shen, Haibo Li, Bin Cao

PMC · DOI: 10.1111/irv.70227 · Influenza and Other Respiratory Viruses · 2026-02-10

## TL;DR

The study identifies three distinct molecular profiles in COVID-19 patients using gene expression data, offering potential biomarkers for personalized treatment strategies.

## Contribution

A novel molecular classification system for COVID-19 patients based on transcriptional profiling and robust biomarker identification.

## Key findings

- Three endotypes with distinct biological and clinical profiles were identified using transcriptomic data.
- Endotype-specific biomarker pairs enable accurate classification and are measurable via RT-qPCR in peripheral blood.
- The classification system reveals heterogeneity in immune responses and could guide precision therapies.

## Abstract

COVID‐19 has caused over 7 million deaths worldwide and remains a critical public health threat. The marked heterogeneity in immune responses among patients poses challenges for targeted treatment. Molecular classification is essential for guiding precision therapies.

We performed unsupervised consensus clustering on blood transcriptomic data from 351 COVID‐19 patients to identify molecular endotypes and validated the classification in an independent cohort of 56 patients. To identify robust endotype‐specific biomarkers, we applied XGBoost, LASSO, and random forest algorithms.

Three endotypes with distinct biological and clinical profiles were identified. Endotype 1, associated with favorable outcomes, showed enriched DNA replication pathways and elevated IL7 expression. Endotype 2 featured hypoxia and angiotensin‐related pathways. Endotype 3 exhibited TLR4 activation, IL‐1β upregulation, and impaired NK cytotoxicity, correlating with poor outcomes. All endotypes shared type I interferon activation. Predictive biomarker pairs included STAT4:S100A11 (endotype 1), SLC4A1:RPL31 (endotype 2), and RALB:MTR (endotype 3), enabling endotype classification with high accuracy. Importantly, these biomarker genes can be reliably measured in peripheral blood using RT‐qPCR, making the classification model feasible for clinical application.

This molecular classification reveals heterogeneity in COVID‐19 and proposes biomarker‐guided strategies for patient stratification and management.

## Linked entities

- **Genes:** IL7 (interleukin 7) [NCBI Gene 3574], STAT4 (signal transducer and activator of transcription 4) [NCBI Gene 6775], S100A11 (S100 calcium binding protein A11) [NCBI Gene 6282], SLC4A1 (solute carrier family 4 member 1 (Diego blood group)) [NCBI Gene 6521], RPL31 (ribosomal protein L31) [NCBI Gene 6160], RALB (RAS like proto-oncogene B) [NCBI Gene 5899], MTR (5-methyltetrahydrofolate-homocysteine methyltransferase) [NCBI Gene 4548], IL1B (interleukin 1 beta) [NCBI Gene 3553], TLR4 (toll like receptor 4) [NCBI Gene 7099]
- **Diseases:** COVID-19 (MONDO:0100096)

## Full-text entities

- **Genes:** RALB (RAS like proto-oncogene B) [NCBI Gene 5899], S100A11 (S100 calcium binding protein A11) [NCBI Gene 6282] {aka HEL-S-43, MLN70, S100C}, IL1B (interleukin 1 beta) [NCBI Gene 3553] {aka IL-1, IL1-BETA, IL1F2, IL1beta}, STAT4 (signal transducer and activator of transcription 4) [NCBI Gene 6775] {aka DPMC, SLEB11}, IL7 (interleukin 7) [NCBI Gene 3574] {aka IL-7, IMD130}, RPL31 (ribosomal protein L31) [NCBI Gene 6160] {aka L31, eL31}, TLR4 (toll like receptor 4) [NCBI Gene 7099] {aka ARMD10, CD284, TLR-4, TOLL}, SLC4A1 (solute carrier family 4 member 1 (Diego blood group)) [NCBI Gene 6521] {aka AE1, BND3, CD233, CHC, DI, EMPB3}
- **Diseases:** COVID-19 (MESH:D000086382), hypoxia (MESH:D000860), deaths (MESH:D003643)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12887440/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12887440/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC12887440/full.md

---
Source: https://tomesphere.com/paper/PMC12887440