# Integrating Gene Expression With Recurrent Mutations Improves Age‐Stratified Risk Prediction in Acute Myeloid Leukemia

**Authors:** Mobina Shrestha, Salina Dahal, Asis Shrestha, Patricia McNally, Amir Babu Shrestha, Niklas Mackler

PMC · DOI: 10.1002/jha2.70261 · 2026-03-11

## TL;DR

Adding gene expression data to known mutations improves predicting outcomes in AML, especially for older patients.

## Contribution

Combining gene expression with mutations enhances age-stratified risk prediction in AML.

## Key findings

- Adding gene expression improved survival and remission prediction models for AML patients.
- TP53 was the strongest adverse marker for survival, while NPM1 improved remission chances.
- CHEK2 and CCNG1 were key genes influencing outcomes, particularly in older patients.

## Abstract

Older adults with acute myeloid leukemia (AML) have inferior outcomes, yet current genetic risk models do not explicitly account for how age modifies the prognostic impact of molecular features. We hypothesized that integrating apoptosis and p53‐related gene expression with recurrent mutations would improve prediction of complete remission (CR) and 2‐year overall survival (OS), particularly across age groups.

Using the BeatAML2 dataset (805 patients; 942 specimens), we built two cohorts: a clinical cohort of 916 adults with full data and an expression‐linked cohort of 852 with matched RNA‐seq. Patients were divided into four age groups 18–30, 30–45, 45–60, and 60+ years. We tested whether adding expression of 12 apoptosis and p53‐related genes to five well‐known mutations, that is TP53, NPM1, FLT3, RUNX1, and ASXL1, could improve the prediction of CR and 2‐year OS.

Adding gene expression improved predictive performance across models. For 2‐year OS, AUCs rose from 0.765 to 0.772 in XGBoost, 0.703 to 0.843 in Random Forest, and 0.697 to 0.721 in Logistic Regression. For CR, performance improved from 0.770 to 0.851 in XGBoost, 0.811 to 0.861 in Random Forest, and 0.731 to 0.696 in Logistic Regression. Calibration was strongest for tree‐based models, and reclassification improved most with XGBoost. Multivariable regression confirmed TP53 as the most adverse marker for OS (HR: 3.07), with added risk from ASXL1 (HR: 1.53) and FLT3 (HR: 1.39). NPM1 increased the chance of remission (OR: 2.47) but did not extend survival. SHAP analysis showed that age remained the leading predictor of OS. Among genes, CHEK2 expression was most important for survival, especially in patients 60 years and older, while CCNG1 expression best predicted remission, along with BAX and MCL1.

These results demonstrate that combining gene expression with recurrent mutations makes risk prediction more accurate, especially in older patients who formed the largest group and had the poorest outcomes. Although treatment variables were not included and analysis focused on selected genes, these findings support incorporation of expression‐based features into genetic risk models and warrant prospective validation.

The authors have confirmed clinical trial registration is not needed for this submission

## Linked entities

- **Genes:** TP53 (tumor protein p53) [NCBI Gene 7157], NPM1 (nucleophosmin 1) [NCBI Gene 4869], FLT3 (fms related receptor tyrosine kinase 3) [NCBI Gene 2322], RUNX1 (RUNX family transcription factor 1) [NCBI Gene 861], ASXL1 (ASXL transcriptional regulator 1) [NCBI Gene 171023], CHEK2 (checkpoint kinase 2) [NCBI Gene 11200], CCNG1 (cyclin G1) [NCBI Gene 900], BAX (BCL2 associated X, apoptosis regulator) [NCBI Gene 581], MCL1 (MCL1 apoptosis regulator, BCL2 family member) [NCBI Gene 4170]
- **Diseases:** acute myeloid leukemia (MONDO:0015667), AML (MONDO:0018874)

## Full-text entities

- **Genes:** NPM1 (nucleophosmin 1) [NCBI Gene 4869] {aka B23, NPM}, MCL1 (MCL1 apoptosis regulator, BCL2 family member) [NCBI Gene 4170] {aka BCL2L3, EAT, MCL1-ES, MCL1L, MCL1S, Mcl-1}, ASXL1 (ASXL transcriptional regulator 1) [NCBI Gene 171023] {aka BOPS, MDS}, CCNG1 (cyclin G1) [NCBI Gene 900] {aka CCNG}, RUNX1 (RUNX family transcription factor 1) [NCBI Gene 861] {aka AML1, AML1-EVI-1, AMLCR1, CBF2alpha, CBFA2, EVI-1}, CHEK2 (checkpoint kinase 2) [NCBI Gene 11200] {aka CDS1, CHK2, HuCds1, LFS2, PP1425, RAD53}, FLT3 (fms related receptor tyrosine kinase 3) [NCBI Gene 2322] {aka CD135, FLK-2, FLK2, STK1}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, BAX (BCL2 associated X, apoptosis regulator) [NCBI Gene 581] {aka BCL2L4}
- **Diseases:** AML (MESH:D015470)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12976655/full.md

---
Source: https://tomesphere.com/paper/PMC12976655