# An exploratory study on predicting HER2-positive expression status of breast cancer using ultrasound radiomics combined with machine learning models

**Authors:** Xin-Ran Zhang, Sha-Sha Yuan, Jiao-Jiao Hu, Qing-Qing Chen, Yang-Jie Xiao, Ying-Fei Huang, Xiao-Qing Yu, Feng Lu, Yan Shen, Xiao-Hong Fu

PMC · DOI: 10.1371/journal.pone.0334909 · PLOS One · 2025-10-23

## TL;DR

This study explores using ultrasound imaging and machine learning to predict HER2-positive breast cancer, achieving strong predictive accuracy.

## Contribution

A novel predictive model for HER2-positive breast cancer using ultrasound radiomics and machine learning is developed and validated.

## Key findings

- Five radiomics features were identified as correlated with HER2-positive status in breast cancer.
- The best model achieved an AUC of 0.893 in training/validation and 0.854 in external validation.
- The model shows potential clinical value for predicting HER2 status using ultrasound imaging.

## Abstract

This study aimed to investigate the feasibility and potential value of predictive models for human epidermal growth factor receptor 2 (HER2)-positive status in breast cancer (BC) based on radiomics features from conventional ultrasound images and machine learning models.

Ultrasound images of 437 patients with surgically and pathologically confirmed BC were retrospectively analyzed, including 144 HER2-positive and 293 HER2-negative cases, which were used as a training and validation dataset. Key features highly correlated with HER2-positive status were identified and selected using the least absolute shrinkage and selection operator (LASSO), t-test, and principal component analysis (PCA). After the selection of relevant features, the dataset was randomly split into five equal parts for five-fold cross-validation to identify the optimal machine learning method and hyperparameters. A predictive model was then developed based on ultrasound imaging and radiomics features. After feature selection and model development, an additional cohort of 88 patients from other hospitals was utilized as an external validation dataset. The model’s internal validation performance was assessed through receiver operating characteristic (ROC) curve analysis, and metrics including area under the curve (AUC), sensitivity, and specificity were calculated. The generalizability of the model was further evaluated using the external validation.

Five radiomics features were found to correlate with HER2-positive status in BC and used for model construction. Among the machine learning models generated, the best predictive model achieved area under the ROC curve values of 0.893 (95% confidence interval [CI], 0.860–0.920) in the training and validation dataset and 0.854 (95% CI, 0.775–0.927) in the external validation dataset.

Machine learning models based on ultrasound radiomics features have potential clinical value for predicting HER2-positive status in BC.

## Linked entities

- **Proteins:** ERBB2 (erb-b2 receptor tyrosine kinase 2)
- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}
- **Diseases:** BC (MESH:D001943)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12548876/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12548876/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/PMC12548876/full.md

---
Source: https://tomesphere.com/paper/PMC12548876