Accuracy of AI-Assisted Methods in Preoperative Templating Compared to the Standard X-Ray-Based Surgeon-Performed Templating in Total Hip Replacement Surgery: A Systematic Review and Meta-Analysis
Osamah Al-Zubaidi, Aiman Alzuabidi, Ban Alkurdi, Dina Alkurdi, Nayef Aslam-Pervez

TL;DR
AI-assisted preoperative templating for hip replacement surgery is more accurate and faster than traditional X-ray-based methods, but more global research is needed.
Contribution
Demonstrates AI-assisted templating significantly improves implant size prediction and reduces operative time compared to standard methods.
Findings
AI-assisted methods had 4.163 times higher odds of exact acetabular cup size prediction compared to manual methods.
AI-assisted templating reduced operative time by 9.2 minutes compared to surgeon-performed templating.
Results suggest AI-assisted templating is more accurate and efficient, though generalizability is limited to Chinese studies.
Abstract
Accurate preoperative templating in total hip arthroplasty (THA) improves the procedure’s precision, shortens its duration, and reduces complications such as instability and dislocations, prosthesis loosening, and loss of bone stock, among others. The current standard method uses X-ray images to manually determine implant size; however, the emergence of artificial intelligence (AI) is gaining increasing interest in its use for preoperative templating. This systematic review and meta-analysis aims to assess the difference in accuracy between the standard and AI methods for preoperative templating in regards to implant size. A systematic search of electronic databases identified 36 papers related to AI in preoperative planning in THA. Of these, 9 studies met the inclusion criteria and cumulatively yielded 1049 patient for comparison. Statistical analysis was performed using…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5| Acetabulum Size Match /n | Stem Size Match /n | Operation Time (Minutes) | |||||
| Author, year | Study design | AI | Man | AI | Man | AI | Man |
| Huo et al, 2021 [ | Randomised controlled trial | 44/59 | 24/59 | 42/59 | 29/59 | NR | NR |
| Ding et al, 2021 [ | Crossover retrospective | 297/316 | 206/316 | 277/316 | 186/316 | NR | NR |
| Chen et al, 2022 [ | Randomised controlled trial | 40/60 | 20/60 | 33/60 | 19/60 | 106.83 ± 18.20 | 109.58 ± 21.98 |
| Wu Zhau, 2023 [ | None-randomised retrospective | 51/95 | 25/66 | 61/95 | 29/66 | 78.5±18.5 | 83.6±18.7 |
| Wu Yang et al, 2023 [ | None-randomised retrospective | 19/34 | 8/27 | 23/34 | 11/27 | 78.8±16.9 | 83.4±17.5 |
| Xie et al, 2024 [ | Crossover retrospective | 138/164 | 105/164 | NR | NR | NR | NR |
| Anwar et al, 2024 [ | Crossover retrospective | 78/117 | 36/117 | 77/117 | 44/117 | NR | NR |
| Yang et al, 2024 [ | Randomised retrospective | 169/220 | 89/220 | 175/220 | 101/220 | 43.22 ± 12.65 | 68.36 ± 14.45 |
| Zheng et al, 2025 [ | Randomised controlled trial | 22/29 | 12/27 | 24/29 | 16/27 | 95±13.74 | 100±26.51 |
| Author, year | Selection | Comparability | Exposure |
| Wu, Zhau, et al 2023 [ | **** | ** | *** |
| Wu, Yang, et al, 2023 [ | **** | ** | *** |
| Author, year | D1: Risk of bias arising from the randomisation process | S: Risk of bias arising from period and carryover effects (crossover studies) | D2: Risk of bias due to deviations from the intended interventions (effect of assignment to intervention) | D3: Risk of bias due to missing outcome data | D4: Risk of bias in measurement of the outcome | D5: Risk of bias in the selection of the reported result | Overall |
| Xie et al, 2024 [ | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Zheng et al, 2025 [ | 1 | N/A | 1 | 1 | 1 | 1 | 1 |
| Anwar et al, 2024 [ | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Yang et al, 2024 [24] | 1 | N/A | 1 | 1 | 1 | 1 | 1 |
| Huo et al, 2021 [ | 1 | N/A | 1 | 1 | 1 | 1 | 1 |
| Ding et al, 2021 [ | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Chen et al, 2022 [ | 1 | N/A | 1 | 1 | 1 | 1 | 1 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOrthopaedic implants and arthroplasty · Total Knee Arthroplasty Outcomes · Advanced X-ray and CT Imaging
Introduction and background
Hip preoperative templating is the process of estimating implant size and positioning in total hip arthroplasty (THA) using preparatory imaging [1]. Accurate templating enhances surgical precision, shortens operative time, and reduces complications such as instability and dislocation, prosthesis loosening and bone stock loss [2,3].
Traditionally, templating was performed manually by overlaying transparencies of magnified implants on radiographs to determine the optimal implant size [4]. The introduction of Picture Archiving and Communication Systems (PACS) has since enabled digital templating, which uses external calibration objects to calculate radiographic magnification [5]. In digital radiography, computer software adjusts templates for magnification [6], after which the surgeon identifies anatomical landmarks, evaluates image quality, defines mechanical references, and positions acetabular and femoral components to achieve a final plan [1]. While this 2D templating method remains the clinical standard [7], it is limited by the intrinsic complexity of the hip as a 3D joint [7]. Errors may arise from magnification, patient positioning, or biomechanical measurements such as femoral offset, which can be affected by hip flexion or rotation [7].
CT-based templating provides a more detailed three-dimensional assessment but involves a labour-intensive workflow, including image segmentation, pelvic correction, deformity recognition, and postoperative simulation. As a result, it is time-consuming and requires collaboration between engineers, programmers, and surgeons, limiting its routine use [8].
Artificial intelligence (AI) has recently emerged as a potential solution, offering automated data organisation, implant sizing, and surgical guidance [9]. AI-HIP software (Beijing Changmugu Medical Technology Co., Ltd., Beijing, China) is one such 3D CT-based system that applies deep learning algorithms to segment images and support preoperative planning. Although increasingly adopted, its role in orthopaedics has not yet been fully validated [10].
This systematic review and meta-analysis, therefore, aims to compare the accuracy of AI-assisted 3D preoperative templating with the conventional 2D approach.
Review
Methods
This systematic review and meta-analysis were performed as per the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement standards (Figure 1) [11].
PRISMA flow chart followed in this meta-analysisPRISMA: Preferred Reporting Items for Systematic Reviews and Meta-AnalysesSource: [11]
Eligibility Criteria
All studies, including randomised and non-randomised trials and observational studies, comparing the accuracy of preoperative templating using an AI-assisted method to the preoperative templating using a standard X-ray-based, human-conducted method in primary and revision total hip replacement operations. Only studies in English were included. There was no restriction on age, comorbidities, indication for total hip replacement, place of study, or size of study.
Exclusion Criteria
Duplicates, abstracts, non-comparative studies, and studies that were not in English and any study that failed to report on the authors’ defined primary outcome measure, which was the exact match in size of the preoperative template with the actual implant size used intraoperatively, were excluded.
Outcome Measures
The primary outcome was the exact match in size of the preoperative template with the actual implant size used intraoperatively. Operative time was the secondary outcome.
Literature Search Strategy
Two authors independently searched the published electronic literature in Google Scholar, PubMed, and MEDLINE. The last search was conducted on 27/5/2025 at 15:39. The search terms included “AI”, "artificial intelligence", ”THR”, "hip replacement", "hip arthroplasty", “THA”, “templating”, “template”. The terms were collated with adjuncts, including ‘and’ as well as ‘or’. The authors also searched the bibliographical lists of relevant articles to optimise article retrieval.
Selection of Studies
Two authors independently evaluated the titles and abstracts of all studies, and those meeting the eligibility criteria were subsequently assessed for their full texts. Any discrepancy in selection was resolved by consultation with a third author.
Data Extraction and Management
A data extraction spreadsheet based on Cochrane’s data collection form for interventional reviews was drafted on Microsoft Excel (Microsoft Corporation, Redmond, WA, US). The information collected included: study year, study design, total number of patients, number of patients who had an exact match using AI and standard templating for acetabular and femoral components, and operative time. The extraction form was pilot tested on randomly selected articles initially before undergoing adjustments. Two authors independently extracted and entered the data, with any ambiguity being resolved by discussion. The included papers and extracted data are detailed in Table 1.
Data Synthesis
OpenMetaAnalysis software (https://openmetaanalysis.github.io/) was utilised to conduct data synthesis with a random effects model being adapted and forest plots reported at 95% confidence intervals (CI). For dichotomous outcome data, the odds ratio was used. This enabled the assessment of the AI-assisted templating method against the standard X-ray-based, surgeon-conducted templating method. For continuous data, the mean difference was utilised to assess the difference in operative time between the two groups.
Methodological Quality and Risk of Bias
The Newcastle-Ottawa Scale (NOS) was used to assess the quality of all non-randomised retrospective studies (Table 2) [18]. It uses a star system with three different domains: selection of study groups, comparability of groups, and the exposure/outcome of interest. It allows for the allocation of a maximum of nine stars for each study. The Cochrane risk of bias tool (The Cochrane Collaboration, London, England, UK) was reserved for the assessment of randomised and crossover trials (Table 3).
Assessment of Heterogeneity
Heterogeneity for all studies was assessed by a dual method of using the Cochran Q test (v2) as well as calculating the I² score. This was interpreted along the following scale: 0-25% low heterogeneity; 25%-75% moderate heterogeneity; and 75%-100% high heterogeneity.
Results
The online literature search retrieved 36 articles. These were screened independently by two authors who excluded duplicates, abstracts, non-comparative studies, studies that were not in English, and any study that failed to report on the author-defined primary outcome measure, which was the exact match in size of the preoperative template with the actual implant size used intraoperatively. A total of nine studies met the inclusion criteria for quantitative synthesis and cumulatively yielded 1049 patients for comparison [8-10,12-17].
AI-Assisted Templating Accuracy
Acetabular component: Nine studies overall reported on the exact match in size of the preoperative template with the actual implant size used intraoperatively for the acetabular cup (Figure 2) [8-10,12-17]. The data showed a heterogeneity of I2 (15%) and a P-value of 0.055. The overall accuracy of using the AI-assisted method was 4.163 times higher than the standard manual method in predicting the acetabular cup component (OR=4.163, P<0.001).
Forest plot showing the odds ratio of the acetabular size match between the preoperative planned size and the actual size used intraoperativelyThe use of AI in preoperative planning was superior to the standard manual method.OR= 4.163, P<0.001, heterogeneity: I2 = 15%, P= 0.055Source: [8-10,12-17]
Femoral component: Eight studies reported on the exact match in size of the preoperative template with the actual implant size used intraoperatively for the femoral stem (Figure 3) [8,10,12-17]. The data showed a heterogeneity of I2 (0%) and a P-value of 0.395. The overall accuracy of using the AI-assisted method was 3.672 times higher than the standard manual method in predicting the femoral stem component (OR=3.672, P<0.001).
Forest plot showing the odds ratio of the femoral stem size match between the preoperative planned size and the actual size used intraoperativelyThe use of AI in preoperative planning was superior to the standard manual method.OR=3.672, P<0.001, heterogeneity: I2 = 0%, and P=0.395Source: [8,10,12–17]
Operative Time
Five studies reported on the operative time using the AI-assisted and surgeon-performed methods (Figure 4) [8,13,14,16,17]. The initial meta-analysis of data showed a significant heterogeneity of 88.48% and a P-value of < 0.001. Operation time following the use of the AI-assisted method was 9.2 minutes less than operations following the surgeon-performed method (MD=-9.2, P=0.027). However, as evident from the forest plot in Figure 4, the Yang et al. paper was identified as an outlier and the source of heterogeneity [16]. After excluding the source of heterogeneity, i.e., Yang et al. [16], the analysis showed that the operation time following the use of the AI-assisted method was 4.35 minutes less than operations following the surgeon-performed method (MD=-4.35, P=0.025) with heterogeneity of 0% and P-value 0.966 (Figure 5).
Forest plot showing the mean difference in operative time between operations in which AI preoperative planning was used and operations in which the standard manual method was usedThe use of AI in preoperative planning resulted in a shorter operative time compared to the standard manual method.MD=-9.2, P=0.027 heterogeneity: I2 =88.48%, P value < 0.001Note that Yang et al. (2024) is an outlier and is the source of heterogeneity.Source: [8,13,14,16,17]
Forest plot showing the mean difference in operative time between operations in which AI preoperative planning was used and operations in which the standard manual method was used after removal of heterogeneityThe use of AI in preoperative planning resulted in a shorter operative time, as compared to the standard manual method.MD=-4.35, P=0.025 heterogeneity: I2=0%, P-value < 0.966Source: [8,13,14,17]
Limitations
In this meta-analysis, only three studies were prospective randomised controlled trials. Thus, higher-level evidence remains necessary to validate the role of AI in preoperative templating. Additionally, all included studies were conducted in China, limiting the generalisability to other healthcare systems and populations.
Discussion
AI is a rapidly developing technology, and its use in medical practice has been gaining increasing interest in recent years. However, its role and validity remain to be fully established. Our meta-analysis aimed to assess the superiority of AI-assisted preoperative templating over the standard X-ray-based surgeon-conducted method and to identify existing gaps in the literature. This review builds upon the preceding systematic review performed by Mozafari et al. [19] by including three additional studies and performing a meta-analysis.
In total, nine studies [8-10,12-17] met the inclusion criteria. The pooled odds ratio assessment demonstrated a significant increase in the accuracy of preoperative templating in total hip replacement (THR) when AI-assisted methods were employed. Moreover, the mean difference analysis showed reduced operative time with AI use. The superiority of AI accuracy is likely attributable to the advantage of 3D CT imaging over conventional 2D radiographs [7]. Notably, Huo et al. reported no significant difference in accuracy between AI-assisted and surgeon-performed templating when both utilised 3D models, though AI templating remained significantly quicker by 28.1 minutes [12]. Similarly, a shorter operative time may result from enhanced accuracy, which reduces intraoperative adjustments [2]. Nevertheless, our operative time analysis showed high heterogeneity, likely reflecting differences in surgical approach, as Yang et al. [16] was the only paper in which the direct anterior approach to the hip was used, whilst the posterolateral approach was used in all other papers [8,13,14,17]. To control this, another meta-analysis excluding the source of heterogeneity was performed, and both results were reported.
Although risk-of-bias assessment revealed overall good quality using the NOS and the Cochrane risk-of-bias tool, only three studies were prospective randomised controlled trials. Thus, higher-level evidence remains necessary to validate the role of AI in preoperative templating. Additionally, all included studies were conducted in China, limiting the generalisability to other healthcare systems and populations.
Beyond accuracy and operative efficiency, the clinical implications of AI-assisted templating merit attention. Greater preoperative precision could improve implant positioning, lower the risk of complications, and potentially reduce revision rates [2,3], thereby enhancing long-term patient outcomes. Reduced operative time may also translate into lower intraoperative infection risk and improved operating theatre utilisation. However, the adoption of AI and 3D CT-based methods carries economic considerations. While operative time savings may reduce costs, the increased expense of advanced imaging and AI software may offset these benefits, warranting future cost-effectiveness analyses.
The integration of AI into surgical workflows also raises practical and ethical questions. Surgeons may require training to effectively interpret AI-generated templates, but AI could simultaneously reduce inter-operator variability, particularly among less experienced surgeons. Ethical and regulatory concerns, including data security, algorithm transparency, and validation across diverse patient groups, must be addressed before widespread implementation.
The authors, therefore, recommend the consideration of the use of AI-assisted preoperative templating for THA surgeries and the adoption of large, multicentre, prospective, double-blinded randomised control studies to fill the gap and gain a high level of evidence to support the introduction of the use of AI-assisted preoperative templating in total hip replacement surgeries in standard clinical practice. We also recommend further studies to assess the cost-effectiveness and the long-term outcomes such as patient satisfaction, implant survival, and rate of revision.
Conclusions
AI-assisted preoperative templating showed a significant increase in accuracy and resulted in shorter operative duration when compared to the standard X-ray-based, surgeon-performed preoperative templating in total hip replacement, although a high level of evidence is still lacking. The adoption of large, multicentre, prospective, double-blinded, randomised control studies is recommended to fill the gap and gain a high level of evidence to support the introduction of the use of AI-assisted preoperative templating in total hip replacement surgeries in standard clinical practice.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Primary hip arthroplasty templating on standard radiographs. A stepwise approach Acta Orthop Belg Scheerlinck T 432442762010 https://upload.orthobullets.com/journalclub/free_pdf/20973347_20973347.pdf 20973347 · pubmed ↗
- 2A new digital preoperative planning method for total hip arthroplasties Clin Orthop Relat Res Crooijmans HJ Laumen AM van Pul C van Mourik JB 90991646720091878136710.1007/s 11999-008-0486-y PMC 2650039 · doi ↗ · pubmed ↗
- 3King Mark's dual-marker versus a conventional single-marker templating system: is there a difference in accuracy of predicting final implant sizes and leg lengths?Eur J Orthop Surg Traumatol Al-Ashqar M Aslam N Azhar MS Grayston J Hahnel J 1671753320233484299010.1007/s 00590-021-03174-y · doi ↗ · pubmed ↗
- 4Digital versus conventional templating techniques in preoperative planning for total hip arthroplasty Can J Surg Kosashvili Y Shasha N Olschewski E Safir O White L Gross A Backstein D 611522009 https://pmc.ncbi.nlm.nih.gov/articles/PMC 2637640/19234645 PMC 2637640 · pubmed ↗
- 5Preoperative templating for total hip arthroplasty: a method for calibrating digital radiographs using patient demographics and anthropometric measurements Cureus Attwood J Banks P Sidhom A Pandit H Sidhom S van Duren B 015202310.7759/cureus.47668 PMC 1066794438022321 · doi ↗ · pubmed ↗
- 6Templating hip arthroplasty Open Access Maced J Med Sci Alnahhal A Aslam-Pervez N Sheikh HQ 672685720193089493310.3889/oamjms.2019.088PMC 6420946 · doi ↗ · pubmed ↗
- 7Comparison of the accuracy of 2D and 3D templating methods for planning primary total hip replacement: a systematic review and meta-analysis EFORT Open Rev Bishi H Smith JB Asopa V Field RE Wang C Sochart DH 7083720223507641310.1530/EOR-21-0060 PMC 8788155 · doi ↗ · pubmed ↗
- 8Development and validation of an artificial intelligence preoperative planning system for total hip arthroplasty Front Med (Lausanne) Chen X Liu X Wang Y 841202920223539188610.3389/fmed.2022.841202 PMC 8981237 · doi ↗ · pubmed ↗
