# Explainable machine learning model to predict 6-month exclusive breastfeeding: a prospective cohort study in Jiangsu, China

**Authors:** Qian Wu, Chintana Wacharasin, Yan Tang

PMC · DOI: 10.1186/s12884-026-08671-8 · BMC Pregnancy and Childbirth · 2026-01-26

## TL;DR

This study uses machine learning to identify factors influencing 6-month exclusive breastfeeding in Jiangsu, China, offering insights to improve breastfeeding support.

## Contribution

An explainable machine learning model with SHAP analysis is developed to predict and interpret 6-month exclusive breastfeeding outcomes.

## Key findings

- XGBoost achieved high performance with an AUC of 0.999 in training and 0.853 in validation.
- Key predictors included breastfeeding intention, subjective norm, and perceived control.
- Only 12.3% of mothers sustained exclusive breastfeeding for 6 months.

## Abstract

Exclusive breastfeeding (EBF) during the first 6 months is globally recommended for optimal maternal and child health. Nevertheless, adherence to this recommendation remains suboptimal in China. This study aimed to identify predictors of 6-month EBF among mothers in Jiangsu Province by developing an explainable machine learning (ML) model within a prospective cohort design.

Between August 2022 and March 2023, postpartum women were recruited through multistage random sampling across hospitals of different levels. Data were collected via structured discharge interviews and three follow-up calls using validated instruments. Least Absolute Shrinkage and Selection Operator (LASSO) regression was applied for feature selection. Four ML algorithms, including Extreme Gradient Boosting (XGBoost), Random Forest, Decision Tree, and Logistic Regression, were compared using tenfold cross-validated area under the receiver operating characteristic curve (AUC) in the training set. The best-performing algorithm was then retrained on the full training set and evaluated in an independent validation set. SHapley Additive exPlanations (SHAP) was applied to enhance interpretability.

A total of 374 mothers completed follow-ups. Fewer than half sustained EBF for 2 months, about one-third for 4 months, and only 12.3% for 6 months. XGBoost showed the highest cross-validated performance (mean AUC = 0.75). After retraining, the XGBoost model achieved an AUC of 0.999 in the full training set and 0.853 in the validation set. SHAP analysis identified the most influential predictors in the following order: breastfeeding (BF) intention, subjective norm, perceived control, BF attitude, BF knowledge, maternal education, and exposure to BF education.

Sustaining EBF for 6 months remains challenging. The XGBoost model, interpreted using SHAP, demonstrated acceptable internal performance. It also yielded exploratory yet informative insights into factors influencing 6-month EBF. These findings generate preliminary evidence that may inform locally relevant EBF support efforts and contribute to the growing body of data-driven EBF studies. External validation is required before considering broader applicability.

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** ovarian cancer (MESH:D010051), BF (MESH:C565501), breast cancer (MESH:D001943), type 2 diabetes (MESH:D003924), maternal illness (MESH:D000079262), lactose intolerance (MESH:D007787), deaths (MESH:D003643), breast milk deficiency (MESH:D061325), weight loss (MESH:D015431), gastrointestinal infections (MESH:D005767), otitis media (MESH:D010033), respiratory infections (MESH:D012141), PIM (MESH:D000309)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12918221/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12918221/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/PMC12918221/full.md

---
Source: https://tomesphere.com/paper/PMC12918221