# Artificial neural network-based immune biomarker signature predicts pathological complete response to neoadjuvant chemotherapy in HER2-negative breast cancer

**Authors:** Yulong Zhang, Shitang Nong, Suai Deng, Qiyuan Su, Jiao Lu, Dalang Fang, Shihuan Qin, Yanfei Ma

PMC · DOI: 10.3389/fonc.2026.1781380 · Frontiers in Oncology · 2026-03-04

## TL;DR

A new AI model using immune-related genes can predict which HER2-negative breast cancer patients will respond well to chemotherapy before treatment.

## Contribution

Development of an artificial neural network model using five immune-related genes to predict pathological complete response to chemotherapy in HER2-negative breast cancer.

## Key findings

- The ANN model achieved an AUC of 0.858 in training and 0.773 in validation for predicting pCR.
- The five-gene signature correlates with immune cell infiltration in the tumor microenvironment.
- qRT-PCR confirmed the gene signature's association with pCR in pretreatment tumor tissues.

## Abstract

Neoadjuvant chemotherapy (NAC) is widely used in early-stage and locally advanced HER2-negative breast cancer, yet pathological complete response (pCR) is achieved in only a subset of patients. Reliable pretreatment biomarkers for predicting pCR are lacking, particularly for patients treated with standard anthracycline- and taxane-based regimens. Increasing evidence indicates that chemotherapy efficacy is closely linked to the tumor immune microenvironment, suggesting that immune-related molecular signatures may improve response prediction.

A total of 2,385 pretreatment HER2-negative breast cancer patients from ten GEO cohorts were included. GSE194040 (n = 743) was used for training, and nine independent cohorts (n = 1,642) were used for external validation. Differential expression analysis was performed separately in hormone receptor positive and negative subgroups, and genes showing concordant regulation between pCR and non-pCR cases were identified. Weighted gene co-expression network analysis (WGCNA) was applied to detect pCR-associated gene modules. Immune-related genes curated from the ImmPort database were intersected with candidate genes, followed by feature selection using least absolute shrinkage and selection operator regression, random forest, and support vector machine recursive feature elimination. An artificial neural network (ANN) model was constructed based on overlapping features and evaluated using receiver operating characteristic analysis. Immune infiltration was estimated by CIBERSORT, and transcription factor, competing endogenous RNA, and drug enrichment analyses were performed. Key genes were further validated by quantitative real-time PCR in pretreatment tumor tissues.

Five immune-related genes (CCL2, CXCL10, CXCL13, HLA-E, and IGKV1D-8) were identified as robust predictors of pCR and used to build the ANN model. The model achieved an area under the curve of 0.858(95% CI: 0.829–0.888) in the training cohort and 0.773 (95% CI: 0.735–0.808) in the external validation cohorts, demonstrating s predictive performance across independent datasets. High expression of the five-gene signature was associated with increased infiltration of cytotoxic and antigen-presenting immune cells, consistent with an immune-activated tumor microenvironment, and was confirmed by qRT-PCR analysis.

This study establishes a rigorously validated ANN-based immune gene signature for predicting response to neoadjuvant chemotherapy in HER2-negative breast cancer, providing a potential tool for pretreatment risk stratifictableation and individualized therapeutic decision-making.

## Linked entities

- **Genes:** CCL2 (C-C motif chemokine ligand 2) [NCBI Gene 6347], CXCL10 (C-X-C motif chemokine ligand 10) [NCBI Gene 3627], CXCL13 (C-X-C motif chemokine ligand 13) [NCBI Gene 10563], HLA-E (major histocompatibility complex, class I, E) [NCBI Gene 3133], IGKV1D-8 (immunoglobulin kappa variable 1D-8) [NCBI Gene 28904]
- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, IGKV1D-8 (immunoglobulin kappa variable 1D-8) [NCBI Gene 28904] {aka IGKV1D8, L24, L24a}, CXCL10 (C-X-C motif chemokine ligand 10) [NCBI Gene 3627] {aka C7, IFI10, INP10, IP-10, SCYB10, crg-2}, HLA-E (major histocompatibility complex, class I, E) [NCBI Gene 3133] {aka HLA-6.2, QA1}, CXCL13 (C-X-C motif chemokine ligand 13) [NCBI Gene 10563] {aka ANGIE, ANGIE2, BCA-1, BCA1, BLC, BLR1L}, CCL2 (C-C motif chemokine ligand 2) [NCBI Gene 6347] {aka GDCF-2, HC11, HSMCR30, MCAF, MCP-1, MCP1}
- **Diseases:** tumor (MESH:D009369), breast cancer (MESH:D001943)
- **Chemicals:** anthracycline (MESH:D018943), taxane (MESH:C080625)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12995788/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12995788/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/PMC12995788/full.md

---
Source: https://tomesphere.com/paper/PMC12995788