# Explainable machine learning model for predicting cesarean section following induction of labor: Development and external validation using real-world data

**Authors:** Yanan Hu, Xin Zhang, Valerie Slavin, Joanne Enticott, Emily Callander, Onicio Leal-Neto, Onicio Leal-Neto

PMC · DOI: 10.1371/journal.pdig.0001061 · PLOS Digital Health · 2025-11-20

## TL;DR

This study developed and validated an explainable machine learning model to predict the likelihood of cesarean section after labor induction, using real-world data from Australian maternity care.

## Contribution

The study introduces a validated, explainable machine learning model for predicting cesarean section after labor induction using routinely collected maternal data.

## Key findings

- The optimal XGBoost model achieved AUROCs of 0.76 and 0.75 in temporal and geographical validations, respectively.
- Nulliparity, pre-pregnancy BMI, and maternal age were the most influential predictors of cesarean section.
- Women with higher predicted cesarean probabilities had increased inpatient costs and maternal morbidity.

## Abstract

Induction of labor (IOL) is a common yet complex clinical procedure associated with varying risks, including cesarean section (CS). Accurate prediction models may help support more informed, personalized decision-making. This study aimed to develop and validate an explainable machine learning prediction model for CS following IOL. We used population-based administrative perinatal datasets from two Australian states (New South Wales (NSW) and Queensland) covering all births between 2016 and 2019 for model development. Temporal validation was conducted using 2020 births from NSW, and geographical validation using 2016–2018 births from Victoria. We included women with singleton, cephalic, term, live births who attempted IOL and had no prior CS. Seven models (logistic regression, random forest, gradient boosting, LightGBM, XGBoost, CatBoost, and AdaBoost) were developed with hyperparameter tuning and feature selection. Performance was assessed using the area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve, calibration plot (overall and across sociodemographic subgroups), decision curve analysis, Brier Score, and model parsimony. SHAP (SHapley Additive exPlanations) values were used to explain predictor contributions. A total of 180,700 women were included in model development (mean age 31 ± 5 years; CS = 20.8%). The optimal model, developed using XGBoost with ten predictors, achieved AUROCs of 0.76 (95% CI: 0.75–0.77) and 0.75 (95% CI: 0.74–0.76) in temporal (n = 14,527; CS = 22.5%) and geographical (n = 14,755; CS = 19.0%) validations, respectively. The most influential predictors were nulliparity, pre-pregnancy body mass index, and maternal age, while diabetes and hypertension (pre-existing or pregnancy-related) contributed least. Women with higher predicted CS probabilities had increased inpatient costs and maternal morbidity, regardless of actual mode of birth. The final model is accessible via an interactive web application (https://csai-8ccf2690242c.herokuapp.com/). This model demonstrates strong predictive performance using routinely collected maternal factors. Further co-design and implementation research is needed before potential clinical adoption.

An increasing number of pregnant women are having their labor induced. One key concern in this process is the potential need for a cesarean section. To support more personalized and informed decision-making, our study focuses on the first essential step: developing and validating machine learning models that predict the likelihood of cesarean section using routinely collected information available before induction. Leveraging large-scale, real-world data from Australian maternity care, our final model demonstrated strong predictive performance and was designed to be transparent and explainable. We have deployed the best-performing model as a publicly accessible, user-friendly web application (https://csai-8ccf2690242c.herokuapp.com/). This tool provides an individualized prediction and establishes a foundation for future research on clinical implementation, user experience, and real-world impact. Ultimately, it may help guide early treatments, reduce unnecessary obstetric interventions, and improve the efficiency of healthcare resource use.

## Linked entities

- **Diseases:** diabetes (MONDO:0005015)

## Full-text entities

- **Diseases:** diabetes (MESH:D003920), hypertension (MESH:D006973)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12633899/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12633899/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/PMC12633899/full.md

---
Source: https://tomesphere.com/paper/PMC12633899