Systematic review of risk prediction models for surgical site infection after abdominal surgery in adults
Yating Xu, Juecen Liu, Yao Chen, Meixuan Song, Xianrong Li

TL;DR
This systematic review evaluates risk prediction models for surgical site infections after abdominal surgery, finding many with good performance but high bias.
Contribution
The study provides a comprehensive overview of existing models and highlights the need for improved validation and generalizability.
Findings
25 out of 28 models showed good predictive performance (AUC > 0.7).
All studies had a high risk of bias.
Common predictors included surgical duration, diabetes, BMI, and ASA score.
Abstract
To systematically review risk prediction models for surgical site infection (SSI) after abdominal surgery and to provide a reference for clinical risk management. A comprehensive search was conducted in Web of Science, Cochrane Library, PubMed, Sinomed, Chinese Medical Journal Full-text Database, CNKI, VIP, and Wanfang Data for studies published from January 1, 1980, to August 12, 2024. Two researchers independently screened the literature, extracted data, and assessed the risk of bias and applicability of the models. A total of 25 studies were included, involving 28 SSI risk prediction models after abdominal surgery. Among them, 25 models showed good predictive performance (AUC > 0.7), but all studies exhibited a high risk of bias. The most frequently included predictors were surgical duration, diabetes, BMI (body mass index), serum albumin levels, ASA (American Society of…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1| Included study | Year | Country | Model type | Study population | Study design | Data source | Total sample size | Outcome indicator | SSI observation period | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Outcome event | Total cases | Incidence rate | |||||||||
| de Oliveira et al. ( | 2006 | Brazil | a | Patients undergoing gastrointestinal surgery | A | Clinical data | 149 | 609 | 24.50% | SSI | From admission to 30 days postoperatively |
| Gervaz et al. ( | 2012 | Switzerl | a + b | Patients undergoing colorectal resection | A | Clinical data | 114 | 534 | 21.30% | SSI | From admission to 30 days postoperatively |
| Hedrick et al. ( | 2013 | The United States | a + b | Patients undergoing elective intra-abdominal colorectal surgery | B | ACS NSQIP | 1719 | 18,403 | 9.34% | Superficial/deep SSI | From admission to 30 days postoperatively |
| Tu et al. ( | 2016 | China | a | Patients undergoing laparoscopic gastrectomy | B | Electronic medical record (EMR) system | 131 | 2,364 | 5.50% | SSI | From admission to 30 days postoperatively |
| Li et al. ( | 2017 | China | a + b | Patients undergoing hepatectomy with hepaticojejunostomy | B | Electronic medical record (EMR) system | 34 | 335 | 10.15% | Space/organ SSI | Within 30 days postoperatively |
| Liu et al.( | 2017 | China | a + b | Patients undergoing abdominal surgery | B | Hospital information system (HIS) | 362 | 5,067 | 7.14% | SSI | — |
| Zhao et al. ( | 2018 | China | a + b | Patients undergoing abdominal surgery | B | Clinical data | 84 | 2,931 | 2.87% | Surgical incision infection | — |
| Tianyu et al. ( | 2019 | China | a + c | Patients undergoing radical hepatectomy for hepatocellular carcinoma | B | Electronic medical record (EMR) system | 149 | 640 | 23.20% | SSI | — |
| Dang et al. ( | 2020 | Canada | a + c | Patients undergoing laparoscopic Roux-en-Y gastric bypass/laparoscopic sleeve gastrectomy | B | MBSAQIP | 1841 | 274,187 | 0.70% | SSI | Within 30 days postoperatively |
| Okui et al. ( | 2020 | Japan | a + b | Patients undergoing gastrointestinal or hepatobiliary tract cancer resection | B | Electronic medical record (EMR) system | 106 | 1,578 | 6.70% | Space/organ SSI | — |
| Kamada et al. ( | 2021 | Japan | a | Patients undergoing elective stoma reversal | B | Medical records | 53 | 182 | 29.00% | Superficial/deep SSI | Within 30 days postoperatively |
| Sun et al. ( | 2021 | China | a | Patients undergoing radical resection for gastrointestinal cancer | A | RCT | 51 | 839 | 6.10% | Space/organ SSI | Within 30 days postoperatively |
| Lin et al. ( | 2021 | China | a + b | Patients undergoing surgery for gastrointestinal perforation | B | Clinical data | 53 | 170 | 31.20% | Surgical incision infection | — |
| Wang et al. ( | 2021 | China | a + c | Patients undergoing colorectal cancer surgery | B | Clinical data | 18 | 85 | 21.17% | SSI | — |
| Pei et al. ( | 2022 | China | a | Patients undergoing radical colorectal cancer surgery | B | Clinical data | 46 | 402 | 11.40% | Intra-abdominal infection | Within 30 days postoperatively |
| Bu et al.( | 2022 | China | a + b | Patients undergoing colorectal cancer surgery | B | Clinical data | 40 | 413 | 9.70% | SSI | Within 30 days postoperatively |
| Gao et al. ( | 2022 | China | a + b | Patients undergoing emergency surgery for intestinal obstruction | B | Clinical data | 38 | 204 | 18.63% | Surgical incision infection | — |
| Huang et al. ( | 2022 | China | a + b | Patients undergoing radical gastrectomy | B | Clinical data | 66 | 355 | 18.60% | SSI | Within 30 days postoperatively |
| Xu et al. ( | 2022 | China | a + b | Patients undergoing elective surgery for gastrointestinal/hepatobiliary/pancreatic tumors | B | Clinical data | 94 | 982 | 9.60% | Space/organ SSI | — |
| Zhang et al. ( | 2023 | China | a | Patients undergoing abdominal surgery | B | Hospital information system (HIS) | 150 | 3,018 | 5% | SSI | — |
| Fu et al. ( | 2023 | China | a | Patients undergoing radical colorectal cancer surgery | B | Electronic medical record (EMR) system | 45 | 242 | 18.60% | Surgical incision infection | — |
| Li et al. ( | 2023 | China | a | Patients undergoing colorectal cancer surgery | B | Electronic medical record (EMR) system | 34 | 398 | 8.54% | Surgical incision infection | — |
| Miao et al. ( | 2023 | China | a + c | Patients with colon cancer complicated by intestinal obstruction | B | Clinical data | 143 | 833 | 17.16% | Surgical incision infection | Within 30 days postoperatively |
| Wang et al. ( | 2023 | China | a | Patients undergoing colorectal cancer surgery | B | Clinical data | 26 | 104 | 25% | Surgical incision infection | — |
| Xu et al. ( | 2023 | China | a + b | Patients undergoing surgery for gastric and colorectal cancer | B | Electronic medical record (EMR) system | gastric cancer 30; colorectal cancer 29 | gastric cancer 356; | gastric cancer 8.4%; colorectal cancer 9.5% | Space/organ SSI | Within 30 days postoperatively |
| Yu et al. ( | 2024 | China | a + b | Patients undergoing radical gastrectomy | B | Clinical data | 27 | 322 | 8.30% | Intra-abdominal infection | — |
| Included study | Model performance | Model validation method | Model presentation | ||
|---|---|---|---|---|---|
| AUC (model development/internal validation/external validation) | Calibration method | Specificity/sensitivity/accuracy | |||
| de Oliveira et al. ( | 0.732/—/— | Hosmer–Lemeshow test | — | — | Risk index/risk score scale |
| Gervaz et al. ( | 0.729/0.703/— | — | — | Internal cross-validation | Risk scoring system |
| Hedrick et al. ( | Model 1.(0.64/—/—) | Calibration plot | — | Internal validation (bootstrap) | Nomogram |
| Tu et al. ( | 0.734/—/— | Hosmer–Lemeshow test | — | Internal random split validation | Risk scoring system |
| Li et al. ( | 0.851/0.850/— | — | 74.1%/85.3%/— | Internal validation (bootstrap) | Nomogram |
| Liu et al.( | 0.803/—/— | Hosmer–Lemeshow test | 67.99%/74.83%/68.49% | Internal random split validation | — |
| Zhao et al. ( | 0.735/—/— | — | — | Internal validation (bootstrap) | Nomogram |
| Tianyu et al. ( | 0.85/0.86/0.84 | — | — | External validation | Nomogram |
| Dang et al. ( | 0.74/—/0.73 | — | 99.95%/48%/— | External validation | Risk scoring formula with |
| Okui et al. ( | POD1all:0.751/—/— | — | POD3all:81%/85%/— | Internal cross-validation | Nomogram |
| Kamada et al. ( | 0.773/—/— | — | — | — | Risk scoring system |
| Sun et al. ( | 0.78/—/— | Hosmer–Lemeshow test | — | — | Risk stratification |
| Lin et al. ( | 0.834/—/— | Calibration curve | — | Internal validation (bootstrap) | Nomogram |
| Wang et al. ( | 0.869/—/— | Hosmer–Lemeshow test | 74%/76%/— | External validation | Risk scoring formula with |
| Pei et al. ( | 0.931/—/— | Calibration curve | — | — | Nomogram |
| Bu et al.( | 0.862/0.873/— | Calibration curve | — | Internal random split validation | Nomogram |
| Gao et al. ( | 0.786/—/— | Hosmer–Lemeshow test | 66.87%/84.21%/— | Internal validation | Risk scoring formula with |
| Huang et al. ( | 0.826/—/— | Calibration curve | — | Internal validation (bootstrap) | Nomogram |
| Xu et al. ( | 0.754/0.708/— | Hosmer–Lemeshow test | 52.17%/93.55%/— | Internal random split validation | Nomogram |
| Zhang et al. ( | 0.926/—/— | — | — | — | Nomogram |
| Fu et al. ( | 0.806/—/— | — | 73.6%/80%/— | — | Tree diagram |
| Li et al. ( | 0.905/—/— | — | 85.62%/87.72%/— | — | Risk scoring formula with |
| Miao et al. ( | 0.892/—/0.786 | Hosmer–Lemeshow test | 80.77%/92.31%/— | External validation | Nomogram |
| Wang et al. ( | 0.928/—/— | Hosmer–Lemeshow test | — | — | Risk scoring formula with |
| Xu et al. ( | Model 1.(0.808/—/—) | — | — | Internal validation (bootstrap) | Nomogram |
| Yu et al. ( | 0.933/0.951/— | Hosmer–Lemeshow test | — | Internal validation (bootstrap) | Nomogram |
| Included studies | Modeling method | Number of models | Missing data | Method for handling continuous variables | Number of predictors | Final included predictors | |
|---|---|---|---|---|---|---|---|
| Number | Handling method | ||||||
| de Oliveira et al. ( | Logistic regression | 1 | 36 | — | Categorical variables | 4 | Obesity, surgical risk, adjusted operative time, laparoscopic surgery |
| Gervaz et al. ( | Logistic regression | 1 | — | — | Categorical variables | 4 | Obesity, wound classification III-IV, open surgery, ASA score III-IV |
| Hedrick et al. ( | Logistic regression | 3 | — | Multiple imputation | Continuous variables | 8 | Open surgery, BMI, preoperative hematocrit, ASA score, smoking, alcohol use, preoperative functional status, age > 75 years |
| Tu et al. ( | Logistic regression | 1 | — | — | Categorical variables | 4 | BMI, intraoperative blood loss, operative time, perioperative transfusion |
| Li et al. ( | Logistic regression | 1 | — | — | Categorical variables | 4 | Bile duct stones, intraoperative blood loss >1,500 mL, history of abdominal surgery, bile leakage |
| Liu et al.( | Logistic regression | 1 | — | — | Categorical variables | 8 | Laparoscopic surgery, wound classification, operative time, diabetes, hypoalbuminemia, hypertension, preoperative inflammatory response, ASA score |
| Zhao et al. ( | Logistic regression | 1 | — | Deletion | Categorical variables | 5 | Diabetes, age, malnutrition, emergency surgery, lack of postoperative health education |
| Tianyu et al. ( | Logistic regression | 1 | — | — | Continuous variables | 4 | ASA score, operative time, serum albumin level, repeat hepatectomy |
| Dang et al. ( | Logistic regression | 1 | — | — | Categorical variables | 9 | BMI, operative time, diabetes, hypertension, Roux-en-Y gastric bypass, long-term use of corticosteroids or immunosuppressants, sex, gastroesophageal reflux disease, obstructive sleep apnea |
| Okui et al. ( | Logistic regression | 3 | 47/54/108 | Deletion | Continuous variables | 5 | White blood cell count, platelet count, albumin, C-reactive protein, estimated glomerular filtration rate |
| Kamada et al. ( | Logistic regression | 1 | — | — | Continuous variables | 3 | Subcutaneous fat thickness ≥20 mm, stoma duration ≤20 weeks, SSI after primary surgery |
| Sun et al. ( | Logistic regression | 1 | — | — | Categorical variables | 5 | Gastrectomy, colorectal resection, pancreatoduodenectomy, anesthesia time >4 h, prolonged ICU stay |
| Lin et al. ( | Logistic regression | 1 | — | — | Categorical variables | 5 | Operative time ≥2 h, unwashed incision, BMI ≥ 24 kg/m2, no preoperative antibiotics, intraoperative incision extension |
| Wang et al. ( | Logistic regression | 1 | — | — | Categorical variables | 9 | BMI, operative time, diabetes, wound classification, age, serum albumin level, Dukes stage, type of surgery, probiotic supplementation |
| Pei et al. ( | Logistic regression | 1 | — | — | Categorical variables | 4 | Preoperative serum albumin level, lymphocyte-to-white blood cell ratio < 0.17, low subcutaneous fat mass, low skeletal muscle mass |
| Bu et al.( | Logistic regression | 1 | — | — | Categorical variables | 6 | Open surgery, diabetes, male sex, BMI ≥ 28 kg/m2, preoperative moderate anemia, preoperative neoadjuvant chemotherapy |
| Gao et al. ( | Logistic regression | 1 | — | — | Categorical variables | 3 | Surgery duration ≥ 2 h, age ≥ 65 years, albumin < 28 g/L |
| Huang et al. ( | Logistic regression | 1 | — | — | Categorical variables | 8 | BMI, surgery duration, diabetes, age, body temperature upon entering the operating room, intermittent warming time, intraoperative lowest temperature, surgical approach |
| Xu et al. ( | Logistic regression | 1 | — | — | Categorical variables | 5 | White blood cell count, neutrophil-to-lymphocyte ratio, total bilirubin, C-reactive protein, mean body temperature |
| Zhang et al. ( | Logistic regression | 1 | — | — | Categorical variables | 11 | Antibiotic use, NRS 2002 score ≥ 3, NNIS score ≥ 2, procalcitonin ≥ 0.05 μg/L, low-density lipoprotein < 3.37 mmol/L, general anesthesia, blood loss ≥ 200 mL, surgical site, wound class III, season of surgery, diabetes |
| Fu et al. ( | Decision tree model | 1 | — | — | Categorical variables | 5 | Intraoperative blood loss, surgery duration, diabetes, serum albumin level, TNM stage |
| Li et al. ( | Logistic regression | 1 | — | — | Categorical variables | 3 | Hypoproteinemia, BMI ≥ 24 kg/m2, surgery duration ≥ 3 h |
| Miao et al. ( | Logistic regression | 1 | — | — | Categorical variables | 6 | Open surgery, diabetes, surgery duration ≥ 2 h, age ≥ 60 years, postoperative indwelling catheter ≥ 3 days, neoadjuvant chemotherapy |
| Wang et al. ( | Logistic regression | 1 | — | — | Continuous variables | 2 | Diabetes, drainage tube indwelling time |
| Xu et al. ( | Logistic regression | 2 | — | — | Categorical variables | 4/3 | Gastric cancer: ASA score, intraoperative blood loss, sex, combined organ resection; Colorectal cancer: ASA score, surgery duration, combined organ resection |
| Yu et al. ( | Logistic regression | 1 | — | — | Continuous variables | 6 | BMI, intraoperative blood loss, surgery duration, serum albumin level, blood glucose level, hemoglobin |
| Included studies | Risk of bias | Applicability | Overall | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Participants | Predictors | Outcome | Data analysis | Participants | Predictors | Outcome | Risk of bias | Applicability | |
| de Oliveira et al. ( | − | ? | + | + | + | + | + | + | + |
| Gervaz et al. ( | + | − | − | + | + | + | + | + | + |
| Hedrick et al. ( | + | ? | + | + | + | + | + | + | + |
| Tu et al. ( | + | ? | ? | + | + | + | + | + | + |
| Li et al. ( | + | ? | ? | + | + | + | + | + | + |
| Liu et al.( | + | + | + | + | + | + | + | + | + |
| Zhao et al. ( | + | ? | ? | + | + | + | + | + | + |
| Tianyu et al. ( | + | + | + | + | + | + | + | + | + |
| Dang et al. ( | − | + | ? | + | + | + | + | + | + |
| Okui et al. ( | + | + | + | + | + | + | + | + | + |
| Kamada et al. ( | + | + | + | + | + | + | + | + | + |
| Sun et al. ( | − | ? | ? | + | + | + | + | + | + |
| Lin et al. ( | + | + | + | + | + | + | + | + | + |
| Wang et al. ( | + | + | + | + | + | + | + | + | + |
| Pei et al. ( | + | + | ? | + | + | + | + | + | + |
| Bu et al.( | + | + | ? | + | + | + | + | + | + |
| Gao et al. ( | + | + | + | + | + | + | + | + | + |
| Huang et al. ( | + | + | ? | + | + | + | + | + | + |
| Xu et al. ( | + | ? | ? | + | + | + | + | + | + |
| Zhang et al. ( | + | + | + | + | + | + | + | + | + |
| Fu et al. ( | + | + | + | + | + | + | + | + | + |
| Li et al. ( | + | + | + | + | + | + | + | + | + |
| Miao et al. ( | + | + | ? | + | + | + | + | + | + |
| Wang et al. ( | + | + | ? | + | + | + | + | + | + |
| Xu et al. ( | + | + | ? | + | + | + | + | + | + |
| Yu et al. ( | + | + | ? | + | + | + | + | + | + |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical site infection prevention · Cardiac, Anesthesia and Surgical Outcomes · Colorectal Cancer Surgical Treatments
Introduction
1
Surgical site infection (SSI) is a common postoperative complication and represents an important component of hospital-acquired infections. According to the 1999 guidelines for the prevention of surgical site infection issued by the Centers for Disease Control and Prevention (CDC), SSI is defined as an infection occurring within 30 days after surgery—or within 1 year if an implant is placed—that affects the incision or deep tissues and is associated with the surgical procedure (1). As one of the most common healthcare-associated infections, SSI poses a serious threat to patient outcomes and quality of life. Abdominal surgery carries an especially high risk of SSI due to factors such as abundant subcutaneous fat, extensive operative fields, prolonged operative time, and the presence of a complex intestinal microbiota, with reported incidence rates ranging from 7.5 to 26.7% (2, 3). SSI not only exacerbates patient suffering and delays recovery but also markedly increases healthcare costs and mortality risk (4, 5). Evidence indicates that among general surgery patients, SSI is associated with an average increase of approximately 10,114.5 CNY in hospitalization expenses, a prolonged hospital stay by 13 days, and a 2- to 11-fold higher risk of death compared with uninfected patients (6, 7). Importantly, up to 60% of SSIs are preventable through evidence-based infection control measures (7). Therefore, preoperative identification of high-risk patients and implementation of tailored interventions are crucial for reducing SSI incidence.
In recent years, numerous predictive models for postoperative SSI in abdominal surgery have been developed worldwide to facilitate early risk assessment. However, considerable heterogeneity exists across these models in terms of methodological quality, predictive performance, and clinical applicability, and no systematic review has yet synthesized and compared their overall performance. To address this gap, the present study aims to systematically evaluate risk prediction models for postoperative SSI in patients undergoing abdominal surgery, with the goal of providing clinicians with robust tools for risk stratification and offering a foundation for the refinement and broader application of such models.
Materials and methods
2
Problem formulation
2.1
This study was conducted in accordance with the TRIPOD-SRMA reporting guideline for systematic reviews of prediction models (8). The research question was defined using the PICOTS framework proposed by the Cochrane Prognosis Methods Group (9):
① P (Population): adult patients (≥18 years) undergoing abdominal surgery;
② I (Intervention model): development, validation, or updating of a prediction model;
③ C (Comparator): none;
④ O (Outcome): occurrence of SSI;
⑤ T (Timing): within 30 days postoperatively;
⑥ S (Setting): inpatient settings where models are applied to predict postoperative SSI.
Inclusion and exclusion criteria
2.2
Inclusion criteria
2.2.1
(1) Study population: adult patients (≥18 years) undergoing abdominal surgery;
(2) Content: development of a risk prediction model for postoperative SSI;
(3) Study design: cross-sectional, case–control, or cohort studies;
(4) Outcome: incidence of SSI;
(5) Language: Chinese or English.
Exclusion criteria
2.2.2
(1) Duplicate publications;
(2) Studies not involving prediction model development;
(3) Models developed solely from systematic reviews or meta-analyses;
(4) Studies including fewer than two predictors;
(5) Animal studies, narrative reviews, conference abstracts, and similar publications;
(6) Studies with unavailable full texts;
(7) Dissertations or theses.
This review was registered in PROSPERO (CRD42024576543).
Literature search strategy
2.3
Electronic databases including Web of Science, Cochrane Library, PubMed, Sinomed, Chinese Biomedical Literature Database (CBM), CNKI, VIP, and Wanfang were systematically searched from January 1, 1980, to August 12, 2024, to identify studies on prediction models for postoperative SSI following abdominal surgery.
Chinese search terms included: abdomen, stomach, intestine, hepatobiliary, pancreas and spleen, abdominal hernia, gynecology and obstetrics, urology, surgical site infection, surgical wound infection, incision infection, risk prediction model, risk score and nomogram.
English search terms included: abdomen, Abdom, gastrointestinal, colorectal, hepatopancreatobiliary, laparotomy, gynecolog*, urolog*, gastr*, liver, surgical wound infection, surgical site infection (SSI), incision infection, prediction model, risk prediction model, risk score, nomogram, risk stratification, prognos* model, forecasting model*.
Study selection and data extraction
2.4
Two reviewers independently screened the literature and extracted data in accordance with the predefined eligibility criteria. Any discrepancies were resolved through consultation with a third reviewer. Data extraction was performed using a standardized form designed with reference to the checklist for systematic reviews of prediction models proposed by Moons et al. (10).
Quality assessment
2.5
The methodological quality of included studies was appraised using the Prediction Model Risk of Bias Assessment Tool (PROBAST) (11), which evaluates both risk of bias and applicability. Risk of bias was assessed across four domains: participants, predictors, outcomes, and analysis. Applicability was judged with respect to participants, predictors, and outcomes. Each item was rated as yes, probably yes, no, probably no, or unclear. Overall risk of bias was categorized as low, high, or unclear. A domain was considered high risk if any item was rated no or probably no. A study was deemed low risk only if all domains were judged to be at low risk. Quality assessment was independently conducted by two reviewers, and disagreements were resolved by a third reviewer.
Results
3
Literature search
3.1
A total of 2,508 records were initially retrieved. After stepwise screening, 26 studies involving 315,858 patients undergoing abdominal surgery were included. The study selection process is presented in Figure 1.
Flowchart of literature screening.
Characteristics of included studies
3.2
A total of 26 studies were included, of which seven (12–18) were published in the past 3 years. Most studies were conducted in China (n = 20) (12–31), while the remaining six (32–37) originated from Brazil, Switzerland, the United States, Canada, and Japan. The sample size of individual studies ranged from 85 to 315,858 patients, and the number of predictors included in the models ranged from 3 to 11. Three studies (24, 32, 33) were prospective cohort studies, while the remaining 23 (12–23, 25–31, 34–37) were retrospective cohort studies. The basic characteristics of the included studies are summarized in Table 1.
Characteristics of risk prediction models
3.3
In total, 31 prediction models for postoperative SSI following abdominal surgery were identified. The reported AUC values ranged from 0.64 to 0.933, with 28 models achieving an AUC greater than 0.70, indicating good discriminatory ability. Ten studies (13–15, 20, 21, 26, 29, 31, 35, 36) reported both specificity and sensitivity, with specificity ranging from 52.17 to 99.95% and sensitivity ranging from 48 to 93.55%. Fifteen models (15, 16, 18, 19, 21, 24–32, 34) assessed calibration, while four studies (15, 23, 26, 35) performed external validation. Internal validation was reported in 14 studies (17–22, 25, 28–31, 33, 34, 36), most commonly using bootstrapping or split-sample methods (See Table 2 for details). The number of predictors included per model ranged from 3 to 11. Frequently reported predictors were operative time, diabetes, body mass index (BMI), serum albumin level, American Society of Anesthesiologists (ASA) score, age, intraoperative blood loss, incision type, and open surgery (see Table 3).
Quality assessment of included studies
3.4
Risk of bias assessment
3.4.1
All studies were judged to be at high risk of bias, indicating methodological shortcomings in both the model development and validation phases (see Table 4).
In the participants domain, 23 studies (12–23, 25–31, 34–37) were retrospective in design and therefore considered at high risk of bias. Five studies (12, 21, 33, 34, 36) applied relatively broad inclusion and exclusion criteria, leading to high risk of bias.
In the predictors domain, 22 studies (12–21, 23, 25–30, 32, 34–37) lacked sufficient information to determine whether predictor definitions and assessment criteria were consistent across participants, resulting in unclear risk of bias. In addition, 21 studies (12–18, 21–31, 35–37) did not clarify whether predictor assessors were blinded, also leading to unclear risk.
In the outcomes domain, four studies (23, 32, 34, 37) applied inappropriate SSI classifications and were judged to be at high risk of bias. Another four (13, 14, 21, 26) did not report classification methods, and three (12–14) failed to define SSI explicitly, all rated as unclear risk. One study (23) applied an unreasonable SSI definition, rated as high risk. Furthermore, 23 studies (12–31, 35–37) did not specify whether outcome assessors were blinded, resulting in unclear risk of bias.
In the analysis domain, nine studies (16, 18, 20, 25, 26, 29, 32, 33, 37) had inadequate sample sizes for model development, rated as high risk. Two studies (32, 33) inappropriately handled continuous and categorical predictors, and 18 studies (12–18, 21–23, 25–29, 34, 35, 37) lacked sufficient information regarding variable handling, leading to high risk. Three studies (31, 32, 36) directly excluded cases with missing data, considered high risk, while 20 studies (12–16, 18–29, 33, 35, 37) did not report missing data at all, indicating inadequate reporting. In addition, 21 studies (12–15, 17–20, 22–25, 27–30, 32, 33, 35–37) relied on univariate analyses for predictor selection, regarded as high risk. Eleven studies (12–14, 17, 20, 22, 23, 33, 35–37) did not report model calibration methods, resulting in unclear risk. Six studies (13, 19, 20, 22, 34, 36) did not assess model updating or adjustment, rated as high risk. Fourteen studies (12, 14–16, 18, 23, 24, 26, 27, 29, 31, 32, 35, 37) did not report model fit issues, leaving the risk unclear.
Applicability assessment
3.4.2
All studies focused on adult patients (≥18 years) undergoing abdominal surgery, and the target populations were consistent across studies, indicating good applicability.
Discussion
4
Surgical site infection (SSI) is one of the most common complications following abdominal surgery and is closely associated with patient mortality, complication risk, and healthcare costs (38). Developing accurate risk prediction models can facilitate early identification of high-risk patients and enable timely interventions, thereby reducing the incidence of SSI. Among the 31 models included in this study, most demonstrated satisfactory predictive performance (AUC > 0.7); however, significant deficiencies remain in model development and reporting quality, highlighting the need for further optimization.
Risk prediction models for post-abdominal surgery SSI are still in an exploratory stage
4.1
Currently, the development of SSI risk prediction models after abdominal surgery remains preliminary, with several methodological limitations. First, among the 26 included studies, most were single-center, and only three (24, 32, 33) were prospective; the remainder were retrospective, which may limit the reliability and validity of the datasets. Second, many studies did not rigorously adhere to blinding procedures during predictor assessment and outcome evaluation, potentially introducing bias and affecting model accuracy. Additionally, according to sample size principles for predictive modeling, each predictor variable should typically include 10–20 outcome events (39); however, nine studies (16, 18, 20, 25, 26, 29, 32, 33, 37) had insufficient sample sizes.
Furthermore, variable selection methods remain largely traditional, often relying on univariate analysis, which may omit important predictors or include irrelevant variables, compromising model robustness. Novel approaches such as LASSO regression, SelectKBest, and genetic algorithms (GA) have demonstrated higher efficiency in various modeling contexts (40) and could be considered in future studies. Most studies also failed to report methods for handling missing values and predictor variables; three studies (31, 32, 36) excluded missing data directly, and 18 studies (12–18, 21–23, 25–29, 34, 35, 37) provided no information regarding predictor handling, resulting in a high risk of bias that may impair model generalizability.
Finally, only four studies (15, 23, 26, 35) conducted external validation. The lack of external validation in most models makes it difficult to assess whether their predictive performance is stable, thereby limiting generalizability and clinical applicability. Moreover, the external validation cohorts in most studies were drawn from the same institutions as the development cohorts, emphasizing reproducibility but not model transportability (41). Although some studies reported discrimination (e.g., AUC or C-index), calibration (e.g., Hosmer–Lemeshow test), and clinical utility, overall performance reporting was insufficient, and some models may be overfitted, restricting broader application. Most predictive models included in this study primarily reported the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity as their main performance metrics, whereas indicators better suited for imbalanced outcomes, such as the area under the precision–recall curve (AUC-PR) and F1 score, were rarely reported. Only a few studies further evaluated the clinical utility or overall predictive accuracy of the models. For instance, Bu et al. (28) employed decision curve analysis (DCA) to assess the net clinical benefit of the model, while Huang et al. (30) reported a Brier score of 1.14 to reflect the overall agreement between predicted probabilities and observed outcomes. In summary, risk prediction models for post-abdominal surgery SSI are still in an early developmental stage, and future efforts should comprehensively improve data sources, modeling design, validation strategies, and reporting quality.
Heterogeneity in SSI incidence due to variations in definitions and diagnostic criteria
4.2
The incidence of surgical site infection (SSI) reported in the included studies ranged from 0.7 to 31.2%, indicating substantial heterogeneity across studies, likely attributable to differences in SSI definitions, classification, and diagnostic criteria. A standardized international definition of SSI is currently lacking, and most studies have referenced the prevention guidelines issued by the Centers for Disease Control and Prevention (CDC) (1). Among the included studies, 20 were conducted in China, where SSI was primarily defined according to the 2001 Chinese Ministry of Health’s “Diagnostic Criteria for Hospital Infection” (42). While the core principles of this guideline are largely consistent with the CDC’s 1999 “Guidelines for the Prevention of Surgical Site Infection,” discrepancies remain in specific classifications and implementation details.
Of the 26 included studies, 10 studies (17, 19, 20, 23, 24, 32–34, 36, 37) adopted the CDC SSI diagnostic criteria; 4 studies (12, 27, 28, 35) mentioned SSI definitions without specifying the reference standard; 6 studies (14–16, 21, 22, 29) used the 2001 Chinese Ministry of Health criteria; 4 studies (13, 25, 26, 31) did not report SSI definitions; 1 study (30) referred to the 2018 Asia Pacific Infection Control guidelines, and 1 study (18) followed the Chinese Society of Gastrointestinal Cancer guidelines. These variations in SSI definitions and diagnostic standards may compromise the comparability and consistency of reported outcomes. Moreover, SSI can be further classified into superficial incisional, deep incisional, organ/space, and intra-abdominal infections, each with inherently different risk profiles and incidence patterns.
The incidence of SSIs may vary according to the surgical site, surgical approach, and surgical technique. Surgical technique (open vs. minimally invasive) is an independent determinant of infection risk; however, most existing predictive models have not adequately accounted for this variable, neither including it as a key predictor nor validating model performance in subgroups defined by surgical approach. It is also important to note that the timing of SSI assessment differs by procedure; for example, in patients with implants, the monitoring period may extend up to one year postoperatively. However, most studies included in this analysis did not explicitly report whether such procedures were included or adjust the assessment period accordingly, which may have led to misclassification of some infection events. In addition, most studies relied on retrospective data collection from hospital databases or electronic medical records. Differences in healthcare resources, economic conditions, cultural factors, and infection surveillance systems across countries and regions may further influence the accuracy of SSI monitoring and diagnosis. Some studies also lacked detailed descriptions of SSI definitions, diagnostic procedures, or surveillance methods, limiting the reproducibility of results and cross-study comparability.
Based on the factors described above, future studies should further standardize SSI outcome assessment, particularly by clearly specifying whether implant-related procedures are included and defining the corresponding monitoring period. Well-designed prospective studies are warranted to improve the accuracy and consistency of outcome measurement. On this basis, SSI risk prediction models should be further validated and refined for different populations and subgroups, including those with and without implants.
Predictors of SSI following abdominal surgery
4.3
This study summarized and analyzed the commonly used predictors in existing models, identifying operative duration, diabetes, body mass index (BMI), serum albumin level, ASA score, age, intraoperative blood loss, incision type, and open surgery as consistently high-frequency risk factors for postoperative SSI following abdominal surgery. These variables are generally easy to measure and readily accessible in clinical practice, enhancing their practical applicability.
Zeitlinger et al. (42) reported a positive correlation between operative time and SSI risk, with longer surgeries associated with higher infection rates, potentially due to factors such as surgical complexity, prolonged tissue exposure, and immunosuppression. Bhat et al. (43) suggested that surgeries exceeding 90 min constitute an independent risk factor for SSI, whereas Shen et al. (44) indicated that operative durations ≥180 min are associated with SSI. These findings highlight ongoing debate regarding specific time thresholds, warranting further investigation. Therefore, optimizing preoperative preparation, carefully planning surgical procedures, minimizing operating room personnel movement, and reducing operative time can mitigate risks of instrument contamination and tissue necrosis, ultimately decreasing SSI incidence. Additionally, organizing training sessions and competitions to enhance surgical knowledge and technical proficiency among staff can shorten operative duration and reduce patient harm.
Diabetes, as a chronic metabolic disorder, significantly increases postoperative infection risk due to immune dysfunction and microcirculatory impairment (45). Clinical practice should emphasize perioperative blood glucose monitoring for diabetic patients, with timely intervention for abnormal glucose levels to maintain relative glycemic stability. Particular attention should be paid to early postoperative fasting periods and parenteral nutrition, such as lipid emulsions, to manage glucose levels. Systematic patient education should also be implemented, addressing the psychological state of patients, informing them and their families about diabetes-related risks and dietary considerations, and encouraging active participation in self-management of blood glucose.
Elevated BMI is recognized as an independent risk factor for SSI, as excessive subcutaneous fat can compromise wound perfusion and contribute to tissue necrosis, thereby increasing infection risk (46). Bone et al. (47) found that a BMI > 25 kg/m^2^ increased SSI risk in patients undergoing hepatobiliary surgery, while another study reported BMI > 24 kg/m^2^ as an independent risk factor for SSI following cesarean section (48). BMI thresholds may vary according to surgical type. For elective surgery candidates, preoperative lifestyle interventions such as moderate exercise and dietary adjustments are recommended. Postoperatively, close monitoring of wound status, encouragement of early ambulation, and maintenance of stable glucose levels and body weight are essential.
Serum albumin is an important indicator of nutritional status, and malnutrition reduces resistance and immune function. Hypoalbuminemia can impair wound healing and adversely affect postoperative outcomes (49). Nutritional assessment upon admission, using laboratory measures such as serum albumin and validated assessment scales, is essential. When malnutrition is identified, communication with patients is necessary, alongside dietary adjustments and, when required, enteral or parenteral nutritional support.
ASA score reflects the patient’s preoperative comorbidities and overall physical condition, with higher scores typically associated with chronic conditions such as diabetes and cardiovascular disease, which may compromise immunity and increase infection risk (50). For patients with ASA class III or higher, surgical teams should actively coordinate with anesthesiologists to optimize patient status, ensure thorough preoperative preparation, adhere strictly to aseptic protocols, and reinforce postoperative care to reduce SSI risk.
Increasing age is associated with declining immune function and a higher prevalence of comorbidities, which correspondingly raises infection risk (51). For older patients with multiple comorbidities and poorly controlled conditions, detailed preoperative counseling and interventions to enhance immune function and manage underlying diseases are essential to minimize SSI occurrence. Wang et al. (52) reported that each 1 mL increase in intraoperative blood loss elevated SSI risk by 0.2%, potentially due to increased transfusion-related susceptibility and intraoperative contamination. Surgical teams should employ meticulous techniques to minimize tissue trauma, maintain effective hemostasis, and carefully consider transfusion indications to reduce SSI risk.
Studies have found that Class III incisions carry higher infection rates than Class II, highlighting the importance of proactively managing preoperative infection for higher-risk procedures (53). Surgical approach also significantly affects SSI risk; laparoscopic surgery, associated with smaller incisions and faster recovery, substantially reduces SSI risk compared to open procedures (54). Liu et al. (55) reported SSI rates of 20.86% for open surgery versus 8.61% for laparoscopic surgery. Preoperative patient assessment should guide selection of the most appropriate surgical approach, with laparoscopic procedures preferred when feasible to reduce trauma, postoperative infection risk, and promote recovery.
Gender was also identified as a factor influencing SSI, consistent with findings by Xu et al. (17). Most studies suggest that males have higher SSI risk due to greater abdominal fat, denser body hair, and higher rates of smoking and alcohol consumption, with male colorectal cancer patients exhibiting a 1.2-fold higher SSI risk than females (50, 56). Conversely, Pedroso-Fernandez et al. (57) indicated that females may prefer smaller incisions for cosmetic reasons, potentially increasing surgical difficulty and SSI risk. Some studies, however, report no significant association between gender and postoperative SSI (58). The relationship between gender and SSI remains controversial and warrants further large-scale, high-quality investigations.
Implications for future predictive model development
4.4
To enhance the scientific rigor and clinical utility of SSI risk prediction models following abdominal surgery, future studies should consider optimization in the following areas: (1) Diversifying predictor sources: incorporating readily available clinical and laboratory indicators; (2) Standardizing blinding and reporting: improving model transparency and reproducibility; (3) Rigorous model development design: adopting an EPV ≥ 20 criterion and a minimum sample size of 100 for validation; (4) Standardized variable handling: clearly defining cut-off values for continuous variables and reporting methods for handling missing data; (5) Optimizing variable selection methods: incorporating machine learning techniques such as LASSO regression to enhance model performance; (6) Strengthening validation strategies: employing multicenter study designs, expanding population diversity, and improving model generalizability and transportability; (7) Visual and practical implementation: translating models into web-based calculators, scoring systems, or integrating them into electronic medical record systems for convenient clinical use.
Additionally, adherence to reporting standards such as TRIPOD is recommended, encompassing model development, internal and external validation, and clinical evaluation, to systematically report the research process. This approach can enhance transparency and quality of reporting and promote the advancement of risk prediction model research.
Limitations
4.5
This study has several limitations. First, the reported SSI incidence varied considerably among the included studies, likely due to differences in definitions, classification, and diagnostic criteria, and the varying risk profiles of different SSI types may also affect comparability. Most studies were retrospective and conducted in different countries or regions with diverse infection surveillance and diagnostic practices, which may further compromise outcome ascertainment. Second, only studies published in English and Chinese were included, which could introduce publication bias and potentially omit high-quality studies in other languages. Furthermore, most predictive models were developed and validated in China, limiting their generalizability. Heterogeneity in sample size and predictor variables across studies precluded meta-analysis, restricting the precision of quantitative synthesis. In addition, most models primarily reported AUC, sensitivity, and specificity, while performance metrics more suitable for imbalanced outcomes, such as AUC-PR and F1 score, were seldom reported, and internal and external validations were generally lacking, indicating room for improvement in model development and evaluation. Future research should aim to standardize SSI definitions, adopt prospective study designs, and incorporate multidimensional performance metrics along with established model development and reporting guidelines, such as PROBAST and TRIPOD, to enhance model reliability and clinical applicability.
Conclusion
5
This study reviewed 28 predictive models for postoperative SSI in adult abdominal surgery. The systematic evaluation revealed that existing models still have notable limitations, including inconsistent SSI definitions and diagnostic criteria, high heterogeneity of data sources, and insufficient methodological quality and reporting standards. These shortcomings reduce the accuracy and generalizability of the models. Future research should focus on standardized SSI outcome assessment, rigorous methodological design, transparent reporting, and multicenter validation to enhance the clinical utility of predictive models, advance SSI risk prediction, and ultimately improve patient quality of life.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Mangram AJ Horan TC Pearson ML Silver LC Jarvis WR. Guideline for prevention of surgical site infection, 1999. Centers for Disease Control and Prevention (CDC) hospital infection control practices advisory committee. Am J Infect Control. (1999) 27:97–132. 10196487 · pubmed ↗
- 2Alkaaki A Al-Radi OO Khoja A Alnawawi A Alnawawi A Maghrabi A . Surgical site infection following abdominal surgery: a prospective cohort study. Can J Surg. (2019) 62:111–7. doi: 10.1503/cjs.004818, 30907567 PMC 6440888 · doi ↗ · pubmed ↗
- 3Marzoug OA Anees A Malik EM. Assessment of risk factors associated with surgical site infection following abdominal surgery: a systematic review. BMJ Surg Interv Health Technol. (2023) 5:e 000182. doi: 10.1136/bmjsit-2023-000182, 37529828 PMC 10387634 · doi ↗ · pubmed ↗
- 4KB RS JS Satyanesan J. Surgical site infections in gastrointestinal surgeries: estimation of prevalence, risk factors and bacteriological profile. Cureus. (2024) 16:e 62589. doi: 10.7759/cureus.62589, 39027770 PMC 11256213 · doi ↗ · pubmed ↗
- 5Savio L Simeone P Baron S Antonini F Bruder N Boussen S . Surgical site infection in severe trauma patients in intensive care: epidemiology and risk factors. Ann Intensive Care. (2024) 14:136. doi: 10.1186/s 13613-024-01370-7, 39218984 PMC 11366732 · doi ↗ · pubmed ↗
- 6Wang X Lan T Chang X Yang X. Distribution of pathogenic bacteria and analysis of infection risk factors in surgical site infections of colorectal cancer patients. J Pathog Biol. (2023) 18:1214–1217+1222. doi: 10.13350/j.cjpb.231020 · doi ↗
- 7Liu S Zhang N Sun J Liu Y Yang L Xu X. Interpretation of the implementation plan of special action "strengthen perioperative infection prevention and control and ensure surgical quality and safety". Chin J Nosocomiol. (2024) 34:3521–5. doi: 10.11816/cn.ni.2024-241185 · doi ↗
- 8Snell KIE Levis B Damen JAA Dhiman P Debray TPA Hooft L . Transparent reporting of multivariable prediction models for individual prognosis or diagnosis: checklist for systematic reviews and meta-analyses (TRIPOD-SRMA). BMJ. (2023) 381:e 073538. doi: 10.1136/bmj-2022-073538, 37137496 PMC 10155050 · doi ↗ · pubmed ↗
