# Ancestry-Associated Performance Variability of Open-Source AI Models for EGFR Prediction in Lung Cancer

**Authors:** Mehrdad Rakaee, Amin H. Nassar, Masoud Tafavvoghi, Falah Jabar, Elias Bou Farhat, Elio Adib, Sigve Andersen, Lill-Tove Rasmussen Busund, Mette Pøhl, Åslaug Helland, Alexander Gusev, Biagio Ricciuti, Lynette M. Sholl, Tom Donnem, David J. Kwiatkowski

PMC · DOI: 10.1001/jamaoncol.2025.6430 · 2026-02-12

## TL;DR

This study shows that open-source AI models for predicting EGFR mutations in lung cancer work well overall but have lower accuracy for Asian patients and pleural tissue samples.

## Contribution

The study reveals ancestry-related performance variability in AI models for EGFR prediction, highlighting the need for recalibration in diverse populations.

## Key findings

- AI models achieved high accuracy for EGFR prediction but showed lower performance in Asian ancestry subgroups.
- Performance declined in pleural tissue samples compared to lung specimens.
- AI triage could reduce rapid EGFR testing by 57% while maintaining high sensitivity and specificity.

## Abstract

Do open-source artificial intelligence (AI) models for predicting EGFR mutations from pathology slides perform consistently across patient populations and clinical settings?

In this multicohort study of 2098 patients with lung adenocarcinoma from the US and Europe, open-source AI approaches achieved high accuracy for EGFR prediction and demonstrated overall robust performance. Subgroup analyses revealed lower accuracy in Asian patients and pleural tissue samples.

AI-based histology tools show strong potential as rapid, low-cost adjuncts for identifying EGFR mutations; broader validation and recalibration across diverse populations and tissue types will help ensure equitable clinical adoption and maximize their impact in cancer care.

This cohort study evaluated the performance and generalizability of 2 open-source artifical intelligence models in predicting mutations in EGFR genes in lung adenocarcinomas using data from 1 US and 1 European cohort.

Artificial intelligence (AI) models are emerging as rapid, low-cost tools for predicting targetable genomic alterations directly from routine pathology slides. Although these approaches could accelerate treatment decisions in lung cancer, little is known about whether their performance is consistent across diverse patient populations and tissue contexts.

To evaluate the performance and generalizability of 2 open-source AI pathology models for predicting EGFR mutation status in lung adenocarcinoma (LUAD) across independent cohorts and ancestral subgroups.

This cohort study included patients with LUAD from 2 cohorts: Dana-Farber Cancer Institute (DFCI) from June 2013 to November 2023, and a European-based trial (TNM-I) from August 2016 to February 2022. All patients had paired next-generation sequencing data and hematoxylin-eosin–stained whole-slide images. In the DFCI cohort, genetic ancestry was inferred using germline genotype data. Data analyses were performed from July 2025 to September 2025.

The primary outcome was model performance for predicting EGFR mutation status, measured as the area under the receiver operating characteristic curve (AUC), evaluated overall and across ancestry subgroups and sample types.

Overall, 2098 patients with LUAD were included (mean [SD] age, 66.6 [10.3] years; 1315 female individuals [63%] and 783 male individuals [37%]). In the DFCI cohort (n = 1759; 54 African, 101 American, 95 Asian, 1465 European), EGFR mutations were detected in 432 patients (25%). One AI-pathology model achieved an AUC of 0.83 (95% CI, 0.81-0.85) compared with 0.68 (95% CI, 0.65-0.70) for the other model. In the TNM-I cohort (n = 339), EGFR mutations were detected in 50 patients (15%), with AUCs of 0.81 (95% CI, 0.74-0.88) and 0.75 (95% CI, 0.68-0.83), respectively. In ancestry-stratified analyses of the DFCI cohort, AUCs for the higher-performing model were 0.84 (95% CI, 0.81-0.86) in patients of European ancestry, 0.85 (95% CI, 0.72-0.94) in African ancestry, and 0.68 (95% CI, 0.55-0.78) in Asian ancestry. In sample type analyses, performance declined in pleural (AUC, 0.66; 95% CI, 0.56-0.76) compared with lung specimens (AUC, 0.86; 95% CI, 0.83-0.88). AI-guided triage analyses showed a potential 57% reduction in rapid EGFR testing, while maintaining sensitivity of 0.84 and specificity of 0.99.

This cohort study found that AI-based pathology tools may serve as preliminary adjuncts for EGFR prediction in lung cancer, though performance differences by ancestry warrant careful interpretation.

## Linked entities

- **Genes:** EGFR (epidermal growth factor receptor) [NCBI Gene 1956]
- **Diseases:** lung adenocarcinoma (MONDO:0005061), lung cancer (MONDO:0005138)

## Full-text entities

- **Genes:** EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}
- **Diseases:** LUAD (MESH:D000077192), Cancer (MESH:D009369), Lung Cancer (MESH:D008175), TNM-I (MESH:D006969)
- **Chemicals:** hematoxylin (MESH:D006416), eosin (MESH:D004801)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12902924/full.md

---
Source: https://tomesphere.com/paper/PMC12902924