SHAP-explained machine-learning model for high-risk gastric cancer identification
Hyun Jin Oh, Chung Ho Kim, Jae Kwan Jun, Mina Suh, Kui Son Choi, Il Ju Choi, Bomi Park

TL;DR
This study developed a machine learning model to predict high-risk gastric cancer using factors like age, H. pylori infection, and endoscopic findings, with SHAP explaining the model's predictions.
Contribution
A novel SHAP-explained machine learning model for gastric cancer risk prediction incorporating endoscopic AG/IM and regional factors.
Findings
XGBoost outperformed other models with an AUROC of 0.764 in internal validation.
SHAP analysis identified H. pylori, age, sex, smoking, and AG/IM as key risk factors.
The model showed practical potential for risk-adapted gastric cancer screening workflows.
Abstract
Gastric cancer (GC) remains a major public health concern in Asia. Risk prediction tailored to regional biological features such as Helicobacter pylori (H. pylori) status and high-risk mucosal findings such as atrophic gastritis (AG) and intestinal metaplasia (IM) may help improve the screening workflow. Using a large, real-world, nationwide screening cohort with available endoscopic AG/IM codes, we developed 2-year GC risk prediction models that integrate AG/IM with regional demographic and lifestyle factors. We compared a conventional Cox proportional hazards model (CPHM) with the following machine learning (ML) approaches: extreme gradient boosting (XGBoost), decision tree (DT), and logistic regression (LR). Discrimination and calibration were evaluated through internal and external validations. Model interpretability was assessed using Shapley Additive Explanations (SHAP). The…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHelicobacter pylori-related gastroenterology studies · Gastric Cancer Management and Outcomes · Colorectal Cancer Screening and Detection
