# Utilizing Immuno-Oncology registry data for enhanced non-small cell lung cancer treatment predictions

**Authors:** Yili Zhang, Shaked Lev-Ari, Jacob Zaemes, Alexandra Della Pia, Bianca DeAgresta, Samir Gupta, Alex Marki, Rachel Zemel, Andrew Ip, Adil Alaoui, Charalampos Charalampous, Iris Rahman, Olivia Wilkins, Subha Madhavan, Peter McGarvey, Lauren Pascual, Michael B Atkins, Neil J Shah

PMC · DOI: 10.1093/jamiaopen/ooaf069 · JAMIA Open · 2025-07-09

## TL;DR

This study uses machine learning and clinical data to better predict how non-small cell lung cancer patients will respond to immunotherapy.

## Contribution

A machine learning model combining clinical and biochemical data outperforms PD-L1 in predicting immunotherapy response in NSCLC.

## Key findings

- The KNN model achieved an AUROC of 0.862, surpassing PD-L1's 0.619 in predicting treatment response.
- Key predictive features included ECOG performance status, red cell distribution width, and mean platelet volume.
- Combining multiple biomarkers improves prediction accuracy over single biomarkers like PD-L1.

## Abstract

We aim to leverage more comprehensive phenotypic and genotypic clinical data to enhance the treatment response predictions.

The study cohort includes 213 NSCLC patients who underwent ICI therapy. Patients were categorized based on treatment outcomes: those with complete or partial responses were considered responders, while those exhibiting stable or progressive disease were deemed non-responders. Comprehensive phenotypic and genomic features were selected for prediction. We developed 9 machine learning models. The model demonstrating the highest area under the receiver operating characteristic curve (AUROC) performance was further analyzed using Shapley additive explanation values to interpret the predictive factors.

There were 72 patients who responded to the treatment, while 141 patients were considered non-responders. In total, 57 features were included, encompassing demographics, tumor status, treatment information, pre-treatment information, serum CBC, serum chemistry, and vital signs. The KNN model excelled among the models, achieving an AUROC score of 0.862 and outperforming the conventional PD-L1 biomarker’s AUROC of 0.619. The top features influencing ICI treatment response include the ECOG performance status of 0, lower red cell distribution width, higher mean platelet volume, etc.

The significance of functional status, inflammatory biomarkers, and PD-L1 expression are revealed. This research underscores the potential of using a more nuanced combination of biochemical markers and clinical data to enhance the precision of immunotherapy efficacy predictions, compared with single prognostic biomarkers such as PD-L1.

Our findings emphasize the complex interplay among various risk factors that influence the effectiveness of ICI.

## Linked entities

- **Diseases:** non-small cell lung cancer (MONDO:0005233), NSCLC (MONDO:0005233)

## Full-text entities

- **Genes:** CD274 (CD274 molecule) [NCBI Gene 29126] {aka ADMIO5, B7-H, B7H1, PD-L1, PDCD1L1, PDCD1LG1}
- **Diseases:** inflammatory (MESH:D007249), non-small cell lung cancer (MESH:D002289), tumor (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12239864/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12239864/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC12239864/full.md

---
Source: https://tomesphere.com/paper/PMC12239864