# A machine learning–based triage system for systemic EBV-positive T/NK cell lymphoproliferative diseases of childhood

**Authors:** Pujun Guan, Zihang Chen, Hanze Dong, Xia Guo, Juan Huang, Tian Dong, Mi Wang, Xiaoxi Lu, Fei Huang, Wenbin Li, Yuan Tang, Li Zhang, Ling Pan, Ju Gao, Shikun Wang, Rongbo Liu, Wenyan Zhang, Sha Zhao, Weiping Liu

PMC · DOI: 10.1172/jci.insight.180837 · JCI Insight · 2026-02-23

## TL;DR

A machine learning system called COLLAPSED helps doctors quickly identify high-risk children with a rare EBV-related disease, enabling faster treatment decisions.

## Contribution

The COLLAPSED system is the first machine learning-based triage tool developed and validated for sEBV+T/NK-LPD using small-data-friendly techniques.

## Key findings

- COLLAPSED identifies critical factors for disease severity in sEBV+T/NK-LPD using machine learning.
- The system was validated with 156 patients and further tested with additional cases from literature and transplanted patients.
- The model was simplified into a risk score to improve clinical interpretability and accessibility.

## Abstract

Systemic Epstein-Barr virus–positive (EBV-positive) T/NK cell lymphoproliferative diseases of childhood (sEBV+T/NK-LPD) are a spectrum of rare diseases that have highly variable biological behavior, from indolent conditions to highly aggressive malignancies. Clinicians currently face substantial challenges in promptly assessing disease severity and predicting patient outcomes, leading to limitations in treatment planning. To address this challenge, we constructed a comprehensive triage system to aid in rapid clinical interventions. The study included 156 patients with newly diagnosed sEBV+T/NK-LPD from 42 institutions. An independent prospective cohort of 35 newly enrolled patients was further included to evaluate the model’s performance. An additional 45 patients from the literature and 18 patients who underwent hematopoietic stem cell transplantation were included to test the score’s generalizability. An integrative machine learning strategy was applied to identify robust and optimal factors and to integrate multiple algorithms to enhance the system’s performance and stability. This system, termed COLLAPSED, identifies critical factors and provides a stable, high-performing ensemble. This model was validated externally and simplified into a risk score to improve interpretability and accessibility. The COLLAPSED system substantially enhances clinicians’ ability to rapidly and precisely identify high-risk patients, thus enabling timely clinical decision-making and expedited initiation of potentially lifesaving treatments.

The COLLAPSED system is the first ML-based triage system that was developed and validated for sEBV+T/NK-LPD through small-data-friendly machine-learning techniques.

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, IL2RA (interleukin 2 receptor subunit alpha) [NCBI Gene 3559] {aka CD25, IDDM10, IL2R, IMD41, TCGFR, p55}, ALPP (alkaline phosphatase, placental) [NCBI Gene 250] {aka ALP, PALP, PLAP, PLAP-1}, SLC17A5 (solute carrier family 17 member 5) [NCBI Gene 26503] {aka AST, ISSD, NSD, SD, SIALIN, SIASD}, PDCD1 (programmed cell death 1) [NCBI Gene 5133] {aka ADMIO4, AIMTBS, CD279, PD-1, PD1, SLEB2}, ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}
- **Diseases:** Lymphoma (MESH:D008223), lymphadenopathies (MESH:D008206), Lymphoproliferative Diseases (MESH:D008232), T/NK cell lymphoproliferative diseases (MESH:D054391), liver dysfunction (MESH:D017093), systemic (MESH:D015619), NK-S (MESH:D018455), sHV-LPD (MESH:D006837), CAEBVD (MESH:D006521), bulky disease (MESH:D004194), CAEBVD-T (MESH:D020031), EBV-positive T cell lymphoma of childhood (MESH:D016399), death (MESH:D003643), hemophagocytic lymphohistiocytosis (MESH:D051359), rare diseases (MESH:D035583), hematological diseases (MESH:D006402), EBV-positive) T/NK cell lymphoproliferative diseases (MESH:C563822), effusion (MESH:D000080324), malignancies (MESH:D009369), lymphocytopenia (MESH:D008231), /NK-LPD (MESH:D054066)
- **Chemicals:** ED (MESH:D004540), L (MESH:D007930), CHOP (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12955998/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12955998/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/PMC12955998/full.md

---
Source: https://tomesphere.com/paper/PMC12955998