# International testing and refinement of AI algorithms predicting acute leukemia subtypes from routine laboratory data

**Authors:** Amin T. Turki, Yi Fan, Alberto Hernández-Sánchez, Wellington Silva, Shaun Fleming, Koray Yalcin, Catharina H.M.J. Van Elssen, Yazan Madanat, Magdalena Karasek, Mahmoud Aljurf, Matteo G. Della Porta, Alexandra Martinez-Roca, Luca Guarnera, Katarina Steffen, Evangelia Antoniou, Maria M. Rivas, Deepak K. Mishra, Ansgar T. Blum, Stephania Niry Manantsoa, Adeniyi Adiat, Amir Enshaei, Felicitas Thol, Maria Teresa Voso, Jia Chen, Tusneem Ahmed Elhassan, Anthony V. Moorman, María Belén Vidriales, Nina R. Neuendorff, Ahmet Koc, Pratyush Mishra, Dirk Strumberg, Roma S. Fourmanov, Lukas Heine, Jens Kleesiek, Daniel Munárriz, Gianluca Asti, Mridula Mokoonlall, Marisa Kometas, Eduardo Rego, Rabea Mecklenbrauck, Marta Sobas, Depei Wu, Felix Nensa, Merlin Engelke

PMC · DOI: 10.1038/s41467-026-70584-z · Nature Communications · 2026-03-20

## TL;DR

This study tests an AI tool for diagnosing acute leukemia using routine lab data from a diverse international patient group to reduce health disparities.

## Contribution

The study introduces an ensemble AI model that improves diagnostic accuracy while maintaining generalizability across diverse populations.

## Key findings

- The AI algorithm achieved high AUROC metrics for specific leukemia subtypes, such as 0.98 for promyelocytic leukemia.
- An ensemble of Isolation Forest and Local Outlier Factor improved AML AUROC from 0.72 to 0.84 in the hold-out test set.
- The model was retrained to better suit pediatric patients, addressing differences in leukemia diagnosis between adults and children.

## Abstract

Despite advances for patients with acute leukemia health disparities limit access to diagnosis and treatment. Artificial Intelligence (AI) approaches may address some disparities. We retrospectively assemble a diverse, international cohort of 6206 leukemia patients from 20 centers to test an AI tool designed to support leukemia diagnosis using standard laboratory results. Executing the pretrained algorithm results in varying accuracy metrics. With confidence cutoff predictions, 2000-fold bootstrapped area under the curve (AUROC) metrics are 0.94 for acute myeloid leukemia (AML), 0.98 for the promyelocytic subtype and 0.84 for acute lymphoblastic leukemia. However, this cutoff excludes 70.8–92.5% of patients from predictions. We improve accuracy and robustness, while maintaining generalizability via an ensemble of Isolation Forest and Local Outlier Factor increasing AUROC for AML from 0.72 to 0.84 (hold-out test set, patients below confidence threshold), while excluding only 12.1% of patients. Furthermore, we retrain the algorithm for pediatric patients.

Artificial intelligence (AI) could help improve risk predictions in acute leukaemia and reduce systemic health disparities in the diagnostic process. Here, the authors assemble a diverse, international cohort of 6,206 patients with acute leukaemias and deploy an AI tool to support diagnosis based on standard laboratory results, with refinements for both adult and peadiatric leukaemias.

## Linked entities

- **Diseases:** acute leukemia (MONDO:0010643), acute myeloid leukemia (MONDO:0015667), promyelocytic leukemia (MONDO:0012883), acute lymphoblastic leukemia (MONDO:0004967)

## Full-text entities

- **Genes:** FGB (fibrinogen beta chain) [NCBI Gene 2244] {aka HEL-S-78p}, ELN (elastin) [NCBI Gene 2006] {aka ADCL1, SVAS, WBS, WS}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** cancer (MESH:D009369), acute leukaemia (MESH:D054218), coagulation (MESH:D001778), LDH (MESH:C538133), impaired coagulation (MESH:D025861), APL (MESH:D015473), ALL (MESH:D054198), leukaemias (MESH:D015458), thromboembolic (MESH:D013923), AML (MESH:D015470), Leukemia (MESH:D007938), death (MESH:D003643), AI (MESH:C538142), bleeding (MESH:D006470), leukocytosis (MESH:D007964), cytopenia (MESH:D006402), leukopenia (MESH:D007970), adult (MESH:C538052), PT (MESH:D006526)
- **Chemicals:** ATRA (MESH:D014212), cumarin (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13004982/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13004982/full.md

## References

8 references — full list in the complete paper: https://tomesphere.com/paper/PMC13004982/full.md

---
Source: https://tomesphere.com/paper/PMC13004982