# A computer vision method to evaluate tumor-infiltrating lymphocytes and multiparametric modeling of neoadjuvant systemic therapy response in breast cancer

**Authors:** Mateusz Bielecki, Fang-I Lu, Angeline Vo, Eileen Rakovitch, Katarzyna J. Jerzak, Roberto Salgado, Raffi Karshafian, William T. Tran

PMC · DOI: 10.1177/17588359261417762 · 2026-02-20

## TL;DR

This study uses computer vision and machine learning to predict breast cancer treatment response by analyzing tumor-infiltrating lymphocytes in pre-treatment biopsies.

## Contribution

A novel computer vision method and multiparametric model for predicting neoadjuvant therapy response in breast cancer using TIL spatial features.

## Key findings

- ML models using clinical and graph-based features achieved high predictive accuracy (AUC up to 0.955).
- Ensemble models integrating clinical and graph features outperformed individual models in predicting pCR.
- Significant differences in performance were observed for triple-negative breast cancer models.

## Abstract

Neoadjuvant systemic therapy (NST) is often used to treat locally advanced breast cancer (BC) or patients with early-stage BC at high risk for micrometastatic spread. Pathological complete response (pCR) to NST in BC is associated with excellent prognostic outcomes; however, rates vary significantly. Tumor-infiltrating lymphocytes (TILs) are associated with NST response, suggesting potential as predictive biomarkers.

To develop a computer vision approach to quantify spatial TIL parameters and a multiparametric machine learning (ML) model for predicting NST response.

Retrospective, single institution study of 411 BC patients, combining clinical and graph-level pre-treatment histopathology data to predict response to NST using ML.

Pre-treatment core needle biopsies were prepared, stained with hematoxylin and eosin, and digitized into whole slide images. Convolutional neural networks were applied to segment and classify regions of invasive carcinoma and TILs. Spatial features were extracted based on the coordinates of the TILs within invasive regions, including metrics from Delaunay triangulation, Voronoi diagram analysis, and minimum spanning trees, as well as features capturing cell density and nuclear count. Clinicopathological features were incorporated to support multiparametric modeling. Multiple ML classification models were trained to predict pCR. Logistic regression, K-nearest neighbor, support vector, random forest, Gaussian Naïve Bayes, and extreme gradient boosting models were tested, and model performances were reported.

ML models using clinical and graph-based features achieved high predictive accuracy. The best performing graph feature model reached an area under the receiver operating characteristic curve (AUC) of 0.924. Ensemble models integrating clinical and graph features showed the highest performance, with an AUC of 0.955. Notably, for triple-negative BC, significant differences in predictive performance were demonstrated between clinical and graph feature models (p = 0.026) and between clinical and ensemble models (p = 0.006).

Multiparametric modeling utilizing clinicopathological and graph features obtained from TILs is associated with pCR in BC patients treated with NST.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** FOXP3 (forkhead box P3) [NCBI Gene 50943] {aka AIID, DIETER, IPEX, JM2, PIDX, XPID}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, CD68 (CD68 molecule) [NCBI Gene 968] {aka GP110, LAMP4, SCARD1}, NR4A1 (nuclear receptor subfamily 4 group A member 1) [NCBI Gene 3164] {aka GFRP1, HMR, N10, NAK-1, NGFIB, NP10}, PGR (progesterone receptor) [NCBI Gene 5241] {aka NR3C3, PR}, EREG (epiregulin) [NCBI Gene 2069] {aka EPR, ER, Ep}, GZMB (granzyme B) [NCBI Gene 3002] {aka C11, CCPI, CGL-1, CGL1, CSP-B, CSPB}, BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}, CD3D (CD3 delta subunit of T-cell receptor complex) [NCBI Gene 915] {aka CD3-DELTA, CD3DELTA, IMD19, T3D}, ESR1 (estrogen receptor 1) [NCBI Gene 2099] {aka ER, ESR, ESRA, ESTRR, Era, NR3A1}, MS4A1 (membrane spanning 4-domains A1) [NCBI Gene 931] {aka B1, Bp35, CD20, CVID5, FMC7, LEU-16}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, CD48 (CD48 molecule) [NCBI Gene 962] {aka BCM1, BLAST, BLAST1, MEM-102, SLAMF2, hCD48}, CD19 (CD19 molecule) [NCBI Gene 930] {aka B4, CVID3}, SELL (selectin L) [NCBI Gene 6402] {aka CD62L, LAM1, LECAM1, LEU8, LNHR, LSEL}, LCK (LCK proto-oncogene, Src family tyrosine kinase) [NCBI Gene 3932] {aka IMD22, LSK, YT16, p56lck, pp58lck}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, PRF1 (perforin 1) [NCBI Gene 5551] {aka HPLH2, P1, PFP}
- **Diseases:** lung adenocarcinoma (MESH:D000077192), -I Lu (MESH:D006969), malignancies (MESH:D009369), ORCID iDs (MESH:C535742), inflammatory (MESH:D007249), invasive (MESH:D009361), invasive ductal carcinoma (MESH:D044584), BC (MESH:D001943), stage II-III triple-negative (MESH:D064726), pCR (MESH:D005598), NST (MESH:D016609), invasive lobular carcinoma (MESH:D018275), metastasis (MESH:D009362), death (MESH:D003643)
- **Chemicals:** cyclophosphamide (MESH:D003520), luminal (MESH:D010634), Pertuzumab (MESH:C485206), Taxol (MESH:D017239), trastuzumab (MESH:D000068878), paraffin (MESH:D010232), anthracycline (MESH:D018943), Eosin (MESH:D004801), taxane (MESH:C080625), formalin (MESH:D005557), Docetaxel (MESH:D000077143), pembrolizumab (MESH:C582435), 5-fluorouracil epirubicin cyclophosphamide (-), H&amp;E (MESH:D006371), adriamycin (MESH:D004317), carboplatin (MESH:D016190), Hematoxylin (MESH:D006416)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12924930/full.md

---
Source: https://tomesphere.com/paper/PMC12924930