# Towards interpretable prediction of recurrence risk in breast cancer using pathology foundation models

**Authors:** Jakub R. Kaczmarzyk, Sarah C. Van Alsten, Alyssa J. Cozzo, Rajarsi Gupta, Peter K. Koo, Melissa A. Troester, Katherine A. Hoadley, Joel H. Saltz

PMC · DOI: 10.1038/s41746-025-02334-2 · 2026-01-16

## TL;DR

This paper introduces MAKO, a framework using pathology foundation models to predict breast cancer recurrence risk from histopathology images, offering an accessible alternative to transcriptomic assays.

## Contribution

The study introduces MAKO, a novel benchmarking framework for evaluating pathology foundation models in predicting breast cancer recurrence risk.

## Key findings

- Several foundation models outperformed baseline models in predicting ROR-P scores from histopathology images.
- CONCH achieved the highest ROC AUC, while H-optimus-0 and Virchow2 showed the best correlation with ROR-P scores.
- Pathology models stratified patients by recurrence risk similarly to transcriptomic assays, with tumor regions identified as key for high-risk predictions.

## Abstract

Transcriptomic assays such as the PAM50-based ROR-P score guide recurrence risk stratification in non-metastatic, ER-positive, HER2-negative breast cancer but are not universally accessible. Histopathology is routinely available and may offer a scalable alternative. We introduce MAKO, a benchmarking framework evaluating 12 pathology foundation models and two non-pathology baselines for predicting ROR-P scores from H&E-stained whole-slide images using attention-based multiple instance learning. Foundation models, large neural networks pre-trained on millions of pathology images and adaptable to diverse downstream tasks, were trained and validated on the Carolina Breast Cancer Study and externally tested on TCGA BRCA. Several foundation models outperformed baseline models across classification, regression, and survival tasks. CONCH achieved the highest ROC AUC, while H-optimus-0 and Virchow2 showed the top correlation with continuous ROR-P scores. All pathology models stratified CBCS participants by recurrence similarly to transcriptomic ROR-P. Using the HIPPO interpretability method, we found that tumor regions were necessary and sufficient for high-risk predictions, and we identified candidate tissue biomarkers of recurrence. These results highlight the promise of interpretable, histology-based risk models in precision oncology.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, EREG (epiregulin) [NCBI Gene 2069] {aka EPR, ER, Ep}
- **Diseases:** tumor (MESH:D009369), Breast Cancer (MESH:D001943)
- **Chemicals:** H&amp;E (MESH:D006371)

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12895011/full.md

---
Source: https://tomesphere.com/paper/PMC12895011