ICSI with surgically retrieved sperm in azoospermia: protocol for a systematic review and meta-analysis of reproductive, perinatal, long-term, and paternal outcomes
Panagiotis Tsiartas, Rocio Montejo, Stavros I. Iliadis, Francisco Guillen‑Grima

TL;DR
This study will compare outcomes of ICSI using surgically retrieved sperm versus ejaculated sperm in men with azoospermia, focusing on reproductive, perinatal, and long-term effects.
Contribution
A systematic review and meta-analysis protocol to evaluate the efficacy and safety of ICSI with surgically retrieved sperm in azoospermia patients.
Findings
The study will assess live birth rates and other outcomes in ICSI using surgically retrieved sperm.
It will evaluate perinatal and long-term offspring outcomes to address current evidence gaps.
The protocol follows PRISMA-P and PRISMA 2020 guidelines for systematic review and meta-analysis.
Abstract
Azoospermia affects ~ 1% of men and represents a major cause of severe male infertility. Intracytoplasmic sperm injection (ICSI) with surgically retrieved sperm from the testis or epididymis enables biological fatherhood in obstructive (OA) and non-obstructive azoospermia (NOA) patients. However, comparative studies versus ejaculated sperm remain fragmented, particularly for neonatal and long-term offspring outcomes. This study aims to address the existing evidence gap regarding the outcomes of ICSI cycles using surgically retrieved versus ejaculated sperm in men with azoospermia by evaluating reproductive, perinatal, long-term offspring, and paternal outcomes. A systematic review and meta-analysis will be conducted following the PRISMA-P and PRISMA 2020 guidelines. Eligible studies will include randomized and non-randomized comparative designs of ICSI using surgically retrieved versus…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —Karolinska Institute
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSperm and Testicular Function · Reproductive Health and Technologies · Ovarian function and disorders
Background and rationale
Azoospermia, defined as the complete absence of spermatozoa in the ejaculate, affects approximately 1% of all men and accounts for approximately 10% to 15% of cases of male infertility [1, 2]. It is classified as either obstructive azoospermia (OA), which is caused by reproductive tract blockage, or non-obstructive azoospermia (NOA), which is caused by impaired spermatogenesis. NOA is the predominant form, representing nearly 60% of cases and posing distinct therapeutic challenges [3].
Advances in assisted reproductive technology (ART), particularly intracytoplasmic sperm injection (ICSI), which was first reported in 1992 [4], have enabled men with azoospermia to achieve biological parenthood. Surgical sperm retrieval techniques including testicular sperm aspiration (TESA), testicular sperm extraction (TESE), microdissection testicular sperm extraction (micro-TESE), and epididymal methods, such as percutaneous epididymal sperm aspiration (PESA) and microsurgical epididymal sperm aspiration (MESA), expand options by facilitating epididymal or testicular sperm use. Micro–TESE, introduced in 1999, improved targeted retrieval while minimizing testicular tissue loss [5].
Despite widespread clinical use, uncertainty persists regarding the effectiveness and safety of ICSI with surgically retrieved sperm compared with ejaculated sperm. Most observational studies [6–14] and prior reviews [15–17] emphasize fertilization and pregnancy outcomes, whereas data on live birth, neonatal outcomes, long‑term offspring health, and paternal harms (e.g., retrieval failure, perioperative complications) remain limited. Evidence fragmentation by azoospermia type and sperm source further impedes clinical guidance.
A prospectively registered, methodologically rigorous systematic review and meta-analysis is warranted to synthesize reproductive, perinatal, long-term offspring, and paternal outcomes and inform patients, clinicians, and policymakers.
Aim
To compare the outcome of ICSI cycles using surgically retrieved sperm versus ejaculated sperm among men with azoospermia and to evaluate reproductive, perinatal, long-term offspring, and paternal outcomes across randomized controlled trials, quasi-experimental studies, and observational designs.
Methods
Registration
The methodological approach of this review will follow the Cochrane Handbook for Systematic Reviews of Interventions (version 6.5) [18]. Reporting will follow PRISMA-P [19] and PRISMA 2020 guidelines [20] (Additional file 1) and is registered in the International Prospective Register of Systematic Reviews (PROSPERO, ID: CRD420251142427, registered 22 September 2025).
Participants
Individuals or couples undergoing ART in which the male partner was diagnosed with OA or NOA and autologous oocytes were used.
Intervention
ICSI using surgically retrieved sperm (TESA, TESE, micro‑TESE, PESA, or MESA), fresh or frozen.
Comparator
ICSI using ejaculated sperm.
Primary outcome
Live birth rate per oocyte pick‑up or per started cycle. This denominator reflects the overall effectiveness of the ICSI treatment strategy, encompassing failed sperm retrieval, fertilization failure, and cycle cancellation, and provides the most comprehensive and patient-centered estimate of treatment success. In contrast, live birth per embryo transfer reflects implantation efficiency but excludes cycles that do not reach transfer, potentially overestimating success rates; it will therefore be analyzed as a secondary outcome.
Secondary outcomes
Fertilization, biochemical/clinical/ongoing pregnancy, miscarriage, multiple pregnancy, live birth per embryo transfer, and cumulative live birth rates; perinatal outcomes (gestational age, preterm birth, birth weight, congenital anomalies, perinatal mortality, neonatal intensive care unit (NICU) admission); long-term offspring outcomes (growth, neurodevelopment, cognition, education, psychiatric, cardiometabolic); and paternal outcomes (retrieval failure, perioperative complications).
Exclusion criteria
The exclusion criteria for this systematic review and meta-analysis are as follows: (1) donor oocyte cycles; (2) mixed patient cohorts in which outcomes for individuals with azoospermia cannot be separately identified; (3) use of round spermatid injection (ROSI), intracytoplasmic morphologically selected sperm injection (IMSI), physiological ICSI (PICSI), or magnetic-activated cell sorting (MACS) as the primary intervention; and (4) use of conventional in vitro fertilization (IVF).
Study design
This review will include randomized controlled trials, quasi-experimental studies, prospective and retrospective cohort studies, and case-control studies providing comparative data relevant to the research question. Non-comparative studies will also be included for paternal outcomes inherent to surgical sperm retrieval that cannot be directly compared with ejaculated sperm. The following study designs will be excluded: case series without comparator groups, registry-only cohorts lacking clinic-based ascertainment (i.e., without direct outcome verification by the treating fertility clinic), conference abstracts lacking sufficient methodological detail, narrative reviews, scoping reviews, and other non-comparative study designs.
Information sources and search strategy
We will search PubMed, Embase (Ovid), Scopus, the Cochrane Central Register of Controlled Trials (CENTRAL), Web of Science, and ProQuest Dissertations and Theses Global [18]. Additional sources will include ClinicalTrials.gov [21], the WHO International Clinical Trials Registry Platform (ICTRP) [22], and BASE (Bielefeld Academic Search Engine) for grey literature [23]. Conference proceedings and meeting abstracts will also be captured through the Web of Science and Scopus. The searches will cover literature published from 1990 to the present, without language restrictions. We selected 1990 as the starting point because it corresponds to the introduction and subsequent clinical implementation of ICSI. The initial search strategy will be developed by the review team in collaboration with an experienced medical librarian. The final strategy will then undergo peer review using the PRESS 2015 checklist [24]. The literature search will be focused on studies including men with azoospermia. To increase sensitivity, related search terms (e.g., cryptozoospermia, hypospermatogenesis, low sperm count, oligoasthenoteratozoospermia, oligozoospermia) will also be included to capture studies in which azoospermic men were represented, even if the term “azoospermia” was not explicitly mentioned. Only studies including azoospermic men will be retained for the final analysis. A full draft search strategy for PubMed is provided in Additional file 2. The full strategies used will be provided in the supplementary files and archived on the Open Science Framework (OSF).
Study selection
Screening and study management will be conducted at Covidence systematic review software (Veritas Health Innovation, Melbourne, Australia). Available at www.covidence.org. Two reviewers will independently screen titles/abstracts and full texts (with blinding enabled during title/abstract screening). Disagreements will be resolved by discussion, with arbitration by a third reviewer when necessary. Inter-rater reliability will be quantified via Cohen’s κ statistic, and reasons for full-text exclusion will be documented. The study selection process will be summarized in a PRISMA 2020 flow diagram [20].
Data extraction
Data will be extracted independently and in duplicate via piloted forms [18]. The extracted information will include the study design, participant and intervention characteristics, and all prespecified outcomes. When data are missing or unclear, the study authors will be contacted for clarification [18]. If only medians and interquartile ranges or ranges are reported, means and standard deviations will be estimated via validated methods [25–28].
Risk of bias in individual studies
The risk of bias will be assessed in a standardized manner by two independent reviewers. For randomized controlled trials, we will use the Revised Cochrane Risk of Bias tool (RoB 2), which evaluates bias arising from the randomization process, deviations from intended interventions, missing outcome data, measurement of outcomes, and selection of the reported results [29]. For non-randomized studies, we will apply the ROBINS-I, which is designed to assess bias in observational studies [30]. Disagreements will be resolved by discussion; a third reviewer will adjudicate if consensus cannot be reached [18].
Statistical analysis
Effect measures
For dichotomous outcomes, risk ratios (RRs) with 95% confidence intervals (CIs) will be used, with adjusted estimates preferred when available. For continuous outcomes, mean differences (MDs) will be calculated; if measurement scales differ, standardized mean differences (SMDs) will be applied. When only medians with interquartile ranges (IQRs) or ranges are reported, these values will be converted to means and standard deviations (SDs) using validated methods.
Eligibility for pooling
Meta-analyses will be conducted when at least three studies report a given outcome. If fewer studies are available or if statistical heterogeneity is excessive, the results will be presented narratively.
Meta-analysis model
Meta-analyses will be performed via random–effects models, with restricted maximum likelihood (REML) estimation and Hartung-Knapp-Sidik-Jonkman (HKSJ) adjustment to account for small-sample uncertainty [31].
Rare events and zero-event studies
For rare outcomes, generalized linear mixed models (GLMMs) or beta-binomial models will be applied in sensitivity analyses. In zero-event studies, continuity corrections proportional to the treatment arm size or exact methods will be used [32]. Sensitivity analyses will also be performed, excluding such studies.
Clustering and repeated cycles
When studies report clustering (e.g., multiple cycles per couple), adjusted estimates will be used preferentially. If unavailable, analyses will be restricted to the first cycle per couple in sensitivity analyses. In multi-arm studies, shared comparator groups will be split, or robust variance estimation (RVE) will be applied to account for statistical dependence [33].
Heterogeneity and meta-regression
Between-study heterogeneity will be assessed via τ^2^, I^2^ with 95% CIs, and prediction intervals. Heterogeneity will be considered substantial if the I^2^ exceeds 75% or if the τ^2^ values are large, as suggested in the established methodological literature [34]. In instances of excessive heterogeneity, quantitative pooling will be deemed inappropriate, and a structured narrative synthesis will be conducted following established best practices [18].
If at least 10 studies are available, meta-regression analyses will be conducted to explore potential sources of heterogeneity. The pre-specified effect modifiers to be investigated include the type of azoospermia (obstructive vs. non-obstructive), site of sperm retrieval (testicular vs. epididymal), use of fresh vs. frozen surgical sperm, embryo stage at transfer, type of embryo transfer (single vs. double), female age, calendar mid-year of study, study design (single-center vs. multicenter), and male factor severity (including endocrine, anatomical, histological, and genetic parameters such as FSH levels, testicular volume, histopathology, and genetic abnormalities such as Klinefelter syndrome or Y-chromosome microdeletions, where reported).
Publication bias
Small-study effects will be examined visually via funnel plots and statistically via Egger’s regression test, provided that a minimum of 10 studies is available. Sensitivity analyses will include trim-and-fill methods to evaluate the potential impact of publication bias.
Certainty of evidence
The GRADE approach [35] will be used to rate certainty across outcomes (live birth, perinatal, long-term, paternal). The assessment considers five key domains: risk of bias, inconsistency, indirectness, imprecision, and publication bias. Based on these criteria, the certainty of evidence will be categorized as high, moderate, low, or very low. A summary of findings will be prepared for each outcome. In addition, sensitivity analyses restricted to studies with low or moderate risk of bias will be conducted to refine the certainty ratings.
Discussion
This protocol addresses a critical gap in reproductive medicine. Although ICSI with surgically retrieved sperm has been widely adopted for more than three decades, uncertainty remains regarding its relative efficacy and safety compared with ejaculated sperm, particularly for neonatal outcomes, long-term offspring health, and paternal surgical risks. Previous reviews [15–17] have focused primarily on fertilization and pregnancy, often neglecting live birth and outcomes beyond the perinatal period. Moreover, prior syntheses rarely distinguished OAs from NOAs or considered retrieval sites and preservation methods.
To our knowledge, this will be the first systematic review to comprehensively evaluate reproductive, perinatal, long-term offspring, and paternal outcomes in azoospermic men undergoing ICSI via a prospectively registered, methodologically rigorous approach. By incorporating randomized and non-randomized evidence, applying state-of-the-art risk of bias tools, and grading certainty with GRADE, this review will provide robust conclusions. Subgroup and meta-regression analyses will enhance applicability and help explain heterogeneity.
The implications extend beyond academic interest: the results will directly inform counseling of couples with azoospermia, guide clinical decision-making regarding surgical techniques, and contribute to ART guideline development. By identifying gaps, particularly long-term offspring outcomes and paternal harm, our review will also help prioritize prospective research.
Supplementary Information
Additional file 1: PRISMA-P and PRISMA 2020 guidelines.
Additional file 2: Example of full search strategy (PubMed).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Sharma M, Leslie SW. Azoospermia. [Updated 2023 Nov 18]. In: Stat Pearls [Internet]. Treasure Island (FL): Stat Pearls Publishing; 2025. PMID: 35201719. Available from: https://www.ncbi.nlm.nih.gov/books/NBK 578191/.35201719 · pubmed ↗
- 2Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, et al, editor(s). Cochrane Handbook for Systematic Reviews of Interventions version 6.5 (updated August 2024). Cochrane, 2024. Available from: https://www.cochrane.org/handbook.
- 3Organization WH. International Clinical Trials Registry Platform (ICTRP): mission and vision: World Health Organization; [Available from: https://www.who.int/clinical-trials-registry-platform.
- 4Schünemann HJ, Oxman AD, Brozek J, Glasziou P, Jaeschke R, Vist GE, Williams JW Jr, Kunz R, Craig J, Montori VM, Bossuyt P, Guyatt GH; GRADE Working Group. Grading quality of evidence and strength of recommendations for diagnostic tests and strategies. BMJ. 2008;336(7653):1106–10. 10.1136/bmj.39500.677199.AE. Erratum in: BMJ. 2008;336(7654). 10.1136/bmj.a 139. Schünemann, A Holger J [corrected to Schünemann, Holger J]. PMID: 18483053; PMCID:PMC 2386626.10.1136/bmj.39500.677199.AEPMC 238662618483053 · doi ↗ · pubmed ↗
