Comparative Evaluation of Deep Learning-Based and WHO-Informed Approaches for Sperm Morphology Assessment
Mohammad Abbadi

TL;DR
This study compares a deep learning model and WHO-based criteria for sperm morphology assessment, showing that the AI model offers superior predictive accuracy and clinical utility, potentially improving male fertility evaluations.
Contribution
It introduces a deep learning framework for sperm morphology assessment that outperforms traditional WHO-based criteria in predictive accuracy and clinical utility.
Findings
HuSHeM model shows higher AUC than WHO(+SIRI)
Deep learning improves prediction under class imbalance
Model demonstrates closer calibration and greater clinical benefit
Abstract
Assessment of sperm morphological quality remains a critical yet subjective component of male fertility evaluation, often limited by inter-observer variability and resource constraints. This study presents a comparative biomedical artificial intelligence framework evaluating an image-based deep learning model (HuSHeM) alongside a clinically grounded baseline derived from World Health Organization criteria augmented with the Systemic Inflammation Response Index (WHO(+SIRI)). The HuSHeM model was trained on high-resolution sperm morphology images and evaluated using an independent clinical cohort. Model performance was assessed using discrimination, calibration, and clinical utility analyses. The HuSHeM model demonstrated higher discriminative performance, as reflected by an increased area under the receiver operating characteristic curve with relatively narrow confidence intervals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSperm and Testicular Function · Ovarian function and disorders · Reproductive Biology and Fertility
