# Optimization of guidelines for Risk Of Recurrence/Prosigna testing using a machine learning model: a Swedish multicenter study

**Authors:** Una Kjällquist, Nikos Tsiknakis, Balazs Acs, Sara Margolin, Luisa Edman Kessler, Scarlett Levy, Maria Ekholm, Christine Lundgren, Erik Olsson, Henrik Lindman, Antonios Valachis, Johan Hartman, Theodoros Foukakis, Alexios Matikas

PMC · DOI: 10.1016/j.breast.2025.104489 · The Breast : Official Journal of the European Society of Mastology · 2025-05-07

## TL;DR

This study uses machine learning to improve the selection of postmenopausal breast cancer patients for ROR/Prosigna testing, reducing unnecessary tests and improving treatment accuracy.

## Contribution

The study introduces a machine learning model to optimize patient selection for ROR/Prosigna testing in postmenopausal HR+/HER2- breast cancer patients.

## Key findings

- The machine learning model achieved an AUC of 0.83 in predicting the need for adjuvant chemotherapy based on ROR/Prosigna results.
- The model improved risk stratification and reduced the proportion of patients needing ROR/Prosigna testing compared to current methods.
- Setting cut-offs for risk categories enhanced the accuracy of patient selection for gene expression profiling.

## Abstract

Gene expression profiles are used for decision making in the adjuvant setting in hormone receptor-positive, HER2-negative (HR+/HER2-) breast cancer. While algorithms to optimize testing exist for RS/Oncotype Dx, no such efforts have focused on ROR/Prosigna. This study aims to enhance pre-selection of patients for testing using machine learning.

We included 348 postmenopausal women with resected HR+/HER2-node-negative breast cancer tested with ROR/Prosigna across four Swedish regions. We developed a machine learning model using simple prognostic factors (size, progesterone receptor expression, grade, and Ki67) to predict ROR/Prosigna output and compared the performance regarding over- and undertreatment with commonly employed risk stratification schemes.

Previous classifications resulted in significant undertreatment or large intermediate groups needing gene expression profiling. The machine learning model achieved AUC under ROC of 0.77 in training and 0.83 in validation cohorts for prediction of indication for adjuvant chemotherapy according to ROR/Prosigna. By setting and validating upper and lower cut-offs corresponding to low, intermediate and high-risk disease, we improved risk stratification accuracy and reduced the proportion of patients needing ROR/Prosigna testing compared to current risk stratification.

Machine learning algorithms can enhance patient selection for gene expression profiling, though further external validation is needed.

•Selection of postmenopausal patients with HR+/HER2- N0, breast cancer for gene expression profiling is imprecise.•Common clinical risk definitions result in under- or overtreatment, or large intermediate risk groups, and excessive testing.•We show the feasibility of machine learning algorithms to improve patient selection for testing with ROR/Prosigna.•Further validation in larger external cohorts is needed.

Selection of postmenopausal patients with HR+/HER2- N0, breast cancer for gene expression profiling is imprecise.

Common clinical risk definitions result in under- or overtreatment, or large intermediate risk groups, and excessive testing.

We show the feasibility of machine learning algorithms to improve patient selection for testing with ROR/Prosigna.

Further validation in larger external cohorts is needed.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, LINC-ROR (long intergenic non-protein coding RNA, regulator of reprogramming) [NCBI Gene 100885779] {aka ROR, lincRNA-RoR, lincRNA-ST8SIA3}, NR4A1 (nuclear receptor subfamily 4 group A member 1) [NCBI Gene 3164] {aka GFRP1, HMR, N10, NAK-1, NGFIB, NP10}, PGR (progesterone receptor) [NCBI Gene 5241] {aka NR3C3, PR}
- **Diseases:** breast cancer (MESH:D001943), node (MESH:D012804)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12139505/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12139505/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC12139505/full.md

---
Source: https://tomesphere.com/paper/PMC12139505