# The Efficacy of Electronic Health Record-Based Artificial Intelligence Models for Early Detection of Pancreatic Cancer: A Systematic Review and Meta-Analysis

**Authors:** George G. Makiev, Igor V. Samoylenko, Valeria V. Nazarova, Zahra R. Magomedova, Alexey A. Tryakin, Tigran G. Gevorkyan

PMC · DOI: 10.3390/cancers18020315 · Cancers · 2026-01-20

## TL;DR

AI models using electronic health records can help detect pancreatic cancer early, but challenges like false positives and low accuracy in some cases remain.

## Contribution

This study is the first systematic review and meta-analysis evaluating AI models for early pancreatic cancer detection using only structured EHR data.

## Key findings

- AI models achieved a pooled AUC of 0.785, showing good overall discriminatory ability for pancreatic cancer prediction.
- Neural networks outperformed other models in AUC, but had lower sensitivity and specificity compared to Light Gradient Boosting models.
- Positive predictive values were consistently low (<1%), highlighting the difficulty of screening for a rare disease.

## Abstract

Pancreatic cancer is often detected too late, leading to very low survival rates. Screening everyone is not practical due to the disease’s rarity and high costs. This study explores a new approach: using artificial intelligence (AI) to analyze patients’ existing electronic health records—like doctor’s visit notes and lab results—to identify those at high risk of pancreatic cancer long before symptoms appear. By systematically reviewing existing research, we aimed to determine how accurate these AI tools are. Our findings show they hold significant promise for early detection, which could allow doctors to monitor high-risk patients more closely and ultimately save lives by catching the cancer at a treatable stage. However, challenges such as the potential for false-positive results and the need for further validation in diverse clinical settings must be addressed before its widespread use in clinical practice.

Background: The persistently low 5-year survival rate for pancreatic cancer (PC) underscores the critical need for early detection. However, population-wide screening remains impractical. Artificial Intelligence (AI) models using electronic health record (EHR) data offer a promising avenue for pre-symptomatic risk stratification. Objective: To systematically review and meta-analyze the performance of AI models for PC prediction based exclusively on structured EHR data. Methods: We systematically searched PubMed, MedRxiv, BioRxiv, and Google Scholar (2010–2025). Inclusion criteria encompassed studies using EHR-derived data (excluding imaging/genomics), applying AI for PC prediction, reporting AUC, and including a non-cancer cohort. Two reviewers independently extracted data. Random-effects meta-analysis was performed for AUC, sensitivity (Se), and specificity (Sp) using R software version 4.5.1. Heterogeneity was assessed using I2 statistics and publication bias was evaluated. Results: Of 946 screened records, 19 studies met the inclusion criteria. The pooled AUC across all models was 0.785 (95% CI: 0.759–0.810), indicating good overall discriminatory ability. Neural Network (NN) models demonstrated a statistically significantly higher pooled AUC (0.826) compared to Logistic Regression (LogReg, 0.799), Random Forests (RF, 0.762), and XGBoost (XGB, 0.779) (all p < 0.001). In analyses with sufficient data, models like Light Gradient Boosting (LGB) showed superior Se and Sp (99% and 98.7%, respectively) compared to NNs and LogReg, though based on limited studies. Meta-analysis of Se and Sp revealed extreme heterogeneity (I2 ≥ 99.9%), and the positive predictive values (PPVs) reported across studies were consistently low (often < 1%), reflecting the challenge of screening a low-prevalence disease. Conclusions: AI models using EHR data show significant promise for early PC detection, with NNs achieving the highest pooled AUC. However, high heterogeneity and typically low PPV highlight the need for standardized methodologies and a targeted risk-stratification approach rather than general population screening. Future prospective validation and integration into clinical decision-support systems are essential.

## Linked entities

- **Diseases:** pancreatic cancer (MONDO:0005192)

## Full-text entities

- **Diseases:** cancer (MESH:D009369), PC (MESH:D010190)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12838961/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12838961/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12838961/full.md

---
Source: https://tomesphere.com/paper/PMC12838961