# Personalized prediction of breast cancer candidates for Anti-HER2 therapy using 18F-FDG PET/CT parameters and machine learning: a dual-center study

**Authors:** Zhenguo Sun, Jianxiong Gao, Wenji Yu, Xiaoshuai Yuan, Peng Du, Peng Chen, Yuetao Wang

PMC · DOI: 10.3389/fonc.2025.1590769 · Frontiers in Oncology · 2025-05-14

## TL;DR

This study uses machine learning and PET/CT data to predict HER2 expression levels in breast cancer patients, helping personalize treatment decisions.

## Contribution

A machine learning model using PET/CT and clinicopathological features is proposed to distinguish HER2 expression levels in breast cancer.

## Key findings

- XGBoost models achieved AUCs of 0.888, 0.844, and 0.759 for predicting HER2-zero vs. HER2-low/positive expression.
- For HER2-low vs. HER2-positive expression, XGBoost achieved AUCs of 0.920, 0.814, and 0.693.
- Key features included tumor size, SUVmean, PR status, and metabolic tumor volume.

## Abstract

Accurately evaluating human epidermal growth factor receptor (HER2) expression status in breast cancer enables clinicians to develop individualized treatment plans and improve patient prognosis. The purpose of this study was to assess the performance of a machine learning (ML) model that was developed using 18F-FDG PET/CT parameters and clinicopathological features in distinguishing different levels of HER2 expression in breast cancer.

This retrospective study enrolled breast cancer patients who underwent 18F-FDG PET/CT scans prior to treatment at Lianyungang First People’s Hospital (centre 1, n=157) and the Third Affiliated Hospital of Soochow University (centre 2, n=84). Two classification tasks were analysed: distinguishing HER2-zero expression from HER2-low/positive expression (Task 1) and distinguishing HER2-low expression from HER2-positive expression (Task 2). For each task, patients from Centre 1 were randomly divided into training and internal test sets at a 7:3 ratio, whereas patients from Centre 2 served as an external test set. The prediction models included logistic regression (LR), support vector machine (SVM), extreme gradient boosting (XGBoost) and multilayer perceptron (MLP), and SHAP analysis provided model interpretability. Model performance was evaluated via the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV).

XGBoost models exhibited the best predictive performance in both tasks. For Task 1, recursive feature elimination (RFE) was used to select 8 features, excluding pathological features, and the XGBoost model achieved AUCs of 0.888, 0.844 and 0.759 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the tumour minimum diameter, mean standardized uptake value (SUVmean) and CTmean. For Task 2, 9 features were selected, including progesterone receptor (PR) status as a pathological feature. The XGBoost model achieved AUCs of 0.920, 0.814 and 0.693 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the PR status, maximum tumour diameter and metabolic tumour volume (MTV).

ML models that incorporate 18F-FDG PET/CT parameters and clinicopathological features can aid in the prediction of different HER2 expression statuses in breast cancer.

## Linked entities

- **Proteins:** ERBB2 (erb-b2 receptor tyrosine kinase 2)
- **Chemicals:** 18F-FDG (PubChem CID 68614)
- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** PGR (progesterone receptor) [NCBI Gene 5241] {aka NR3C3, PR}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, ITIH2 (inter-alpha-trypsin inhibitor heavy chain 2) [NCBI Gene 3698] {aka H2P, ITI-HC2, SHAP}, EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}
- **Diseases:** tumour (MESH:D009369), breast cancer (MESH:D001943)
- **Chemicals:** F-FDG (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12116446/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12116446/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12116446/full.md

---
Source: https://tomesphere.com/paper/PMC12116446