# Machine learning–based prediction of IVF/ICSI outcomes in male factor infertility highlighting couple-level BMI

**Authors:** Hu Li, Jie Gao, Yiran Li

PMC · DOI: 10.3389/fendo.2026.1772106 · Frontiers in Endocrinology · 2026-02-10

## TL;DR

This study uses machine learning to predict IVF/ICSI outcomes in couples with male infertility, emphasizing the role of body mass index (BMI) in both partners.

## Contribution

The study introduces a machine learning model that highlights the importance of couple-level BMI in predicting IVF/ICSI outcomes for male infertility.

## Key findings

- A LightGBM model achieved an AUC of 0.857 in predicting IVF/ICSI outcomes.
- Couple-level BMI was strongly associated with model predictions alongside traditional ovarian reserve markers.
- SHAP analysis provided interpretable insights into the model's decision-making.

## Abstract

Most clinical prediction models for assisted reproductive technology focus primarily on female ovarian reserve markers and often under-represent male factors and the metabolic status of both partners. Additionally, traditional parametric models may have limited ability to capture nonlinear patterns within reproductive data. This study aimed to develop and validate a machine learning (ML)–based model to predict clinical pregnancy outcomes in couples with male factor infertility undergoing IVF/ICSI, and to explore model interpretability using Shapley Additive exPlanations (SHAP).

This retrospective study analyzed 2,565 couples undergoing their first IVF/ICSI cycle for male factor infertility at Shanghai First Maternity and Infant Hospital between 2019 and 2025. The cohort was partitioned according to embryo transfer date, with the first 70% of cases assigned to the training set and the remaining 30% reserved as an temporal internal validation set. Feature selection was conducted using LASSO regression within the training set. Seven ML models, including LightGBM and Logistic Regression, were developed and optimized through 5-fold cross-validation. Model performance was evaluated using the area under the curve (AUC), accuracy, Brier score, and decision curve analysis. SHAP was employed to provide a visual interpretation of the optimal model.

Five predictors were selected in the training set: female BMI, male BMI, basal FSH, AMH, and female age. In the temporal validation set, all models demonstrated comparable discriminative performance (AUC range: 0.840–0.857). LightGBM achieved an AUC of 0.857 (95% CI: 0.830–0.882), with an accuracy of 0.775 and specificity of 0.909. DeLong tests indicated no statistically significant differences in AUC between LightGBM and Random Forest (P = 0.918), XGBoost (P = 0.985), or logistic regression (P = 0.067). Based on its overall stability across discrimination, calibration (Brier score = 0.145), and clinical utility, LightGBM was selected for interpretability analysis.

A LightGBM-based prediction model demonstrated reasonable performance for predicting IVF/ICSI outcomes in couples with male factor infertility. Within this dataset, couple-level metabolic features were strongly associated with model predictions alongside traditional ovarian reserve markers. These findings reflect predictive associations rather than causal effects and suggest that metabolic characteristics may warrant consideration in risk stratification and counseling. Prospective studies are needed to determine whether targeted interventions can improve clinical outcomes.

## Full-text entities

- **Genes:** PRL (prolactin) [NCBI Gene 5617] {aka GHA1, pPRL}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, AMH (anti-Mullerian hormone) [NCBI Gene 268] {aka MIF, MIS}
- **Diseases:** oligozoospermia (MESH:D009845), uterine malformations (MESH:D014591), intrauterine adhesions (MESH:D000267), PCOS (MESH:D011085), Infertility (MESH:D007246), weight (MESH:D015431), teratozoospermia (MESH:D000072660), obesity (MESH:D009765), overweight (MESH:D050177), male factor infertility (MESH:D007248), chromosomal karyotype abnormalities (MESH:D059786), inflammation (MESH:D007249), HL (MESH:C538324), asthenozoospermia (MESH:D053627)
- **Chemicals:** testosterone (MESH:D013739), progesterone (MESH:D011374), T (MESH:D014316), P (MESH:D010758), E2 (MESH:D004958)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12929144/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12929144/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12929144/full.md

---
Source: https://tomesphere.com/paper/PMC12929144