# Ovarian cancer recurrence prediction: comparing confirmatory to real-world predictors with machine learning

**Authors:** D. Katsimpokis, A.E.C. van Odenhoven, M.A.J.M. van Erp, H.H.B. Wenzel, M.A. van der Aa, M.M.H. van Swieten, H.P.M. Smedts, J.M.J. Piek

PMC · DOI: 10.1016/j.esmorw.2025.100666 · ESMO Real World Data and Digital Oncology · 2026-01-08

## TL;DR

Machine learning models outperform traditional methods in predicting ovarian cancer recurrence using real-world data from a cancer registry.

## Contribution

This study compares expert-derived and data-driven predictors using ML to improve ovarian cancer recurrence prediction.

## Key findings

- ML models with real-world data outperformed traditional Cox regression in predicting survival.
- XGBoost achieved the highest performance with a c-index of 0.75.
- Treatment variables and socioeconomic status were key predictors not identified by experts.

## Abstract

Ovarian cancer is one of the deadliest cancers in women, frequently diagnosed at an advanced stage, with a 5-year survival rate of 17%-28% in advanced-stage (International Federation of Gynecology and Obstetrics IIB-IV) disease. Machine learning (ML) may provide a better tool for survival prognosis than traditional methods and could provide insight into predictive factors. This study focuses on advanced-stage ovarian cancer and contrasts expert-derived predictive factors with data-driven ones from the Netherlands Cancer Registry (NCR) to predict progression-free survival.

A Delphi questionnaire was conducted to identify 14 predictive factors which were included in the final analysis. ML models (regularized Cox regression, random survival forests, and XGBoost) were used to compare the Delphi expert-based set of variables with a real-world data (RWD) variable set derived from the NCR. A non-regularized Cox model was used as the benchmark.

While regularized Cox models with the RWD variable set outperformed the traditional Cox regression with the Delphi variables (c-index: 0.70 versus 0.64, respectively), XGBoost showed the best performance overall (c-index: 0.75). The most predictive factors for recurrence, not identified by Delphi, were surgery type and debulking results, post-operative chemotherapy administration, number of platinum cycles, and socioeconomic status.

Our results highlight that ML algorithms have higher predictive power compared with the traditional Cox regression. Moreover, RWD from a cancer registry identified more predictive variables than a panel of experts. Overall, these results have important implications for artificial intelligence (AI)-assisted clinical prognosis and provide insight into the differences between AI-driven and expert-based decision making in survival prediction.

•ML outperforms Cox regression in predicting ovarian cancer survival.•Real-world predictors from the NCR outperform expert ones.•XGBoost demonstrated the highest performance with a c-index of 0.75.•Treatment variables and socioeconomic status are key predictors of recurrence.•AI-driven methods show potential to enhance clinical prognostication.

ML outperforms Cox regression in predicting ovarian cancer survival.

Real-world predictors from the NCR outperform expert ones.

XGBoost demonstrated the highest performance with a c-index of 0.75.

Treatment variables and socioeconomic status are key predictors of recurrence.

AI-driven methods show potential to enhance clinical prognostication.

## Linked entities

- **Diseases:** ovarian cancer (MONDO:0005140)

## Full-text entities

- **Diseases:** Ovarian cancer (MESH:D010051), IIB-IV) (MESH:D006011), Cancer (MESH:D009369)
- **Chemicals:** platinum (MESH:D010984)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13040900/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13040900/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC13040900/full.md

---
Source: https://tomesphere.com/paper/PMC13040900