# Artificial intelligence in orthopedic trauma surgery: a scoping review of current applications and research gaps

**Authors:** Lennard M. Wurm, Wolfgang Ertel, Dominik Laue

PMC · DOI: 10.1007/s00402-026-06276-6 · 2026-03-28

## TL;DR

This review explores how artificial intelligence is being used in orthopedic trauma surgery, finding that while AI shows promise, most studies are at an early stage and lack real-world validation.

## Contribution

The study provides a comprehensive scoping review of AI applications in orthopedic trauma surgery, highlighting research gaps and the need for better validation and implementation strategies.

## Key findings

- Most AI studies in orthopedic trauma are retrospective and use deep learning for fracture detection.
- Only a small fraction of AI models have been externally validated or implemented in clinical practice.
- Diagnostic AI models show high accuracy, but prognostic models have moderate-to-high performance.

## Abstract

Artificial intelligence (AI) is rapidly transforming clinical decision-making, yet its role in orthopedic trauma surgery remains fragmented and unevenly validated in clinical practise.

We conducted a PRISMA-SCR–compliant scoping review using a systematic search of the Semantic Scholar, OpenAlex and PubMed corpus via Elicit (497 records). Studies were eligible if they applied AI or machine-learning methods to traumatic orthopedic conditions, included ≥ 10 human subjects, reported quantitative performance metrics, and represented original research. After title/abstract and full-text screening, 146 studies were included. Data on study characteristics, AI methodology, clinical application, validation strategy, performance metrics, explainability and translational maturity were synthesized descriptively.

Research output increased sharply after 2017, with 52% of all studies published since 2022. Most studies were retrospective (≈ 99%). Deep learning dominated the field (61%), particularly for fracture detection and classification, while classical machine-learning models were mainly used for outcome prediction. Internal validation was reported in 85% of studies, whereas only 15% clearly performed external or multicenter validation; true prospective clinical testing was rare (1.4%), and only a small subset of models had been implemented in practice (3.4%). Diagnostic models frequently achieved very high technical accuracy (AUC 0.90–1.00 in constrained tasks), while prognostic models showed moderate-to-high performance (AUC 0.75–0.95). Explainability was underreported, only 24% used any form of saliency mapping, Grad-CAM or feature importance analysis.

AI in orthopedic trauma surgery demonstrates strong technical feasibility but remains overwhelmingly at the proof-of-concept stage. The field is characterized by limited external validation, minimal prospective evidence, scarce explainability, and insufficient workflow integration, factors that collectively hinder clinical translation. To bridge the gap from laboratory performance to real-world impact, future research must emphasize multicenter datasets, rigorous external and prospective validation, explainable AI, and user-centered implementation studies. AI has the potential to augment, rather than replace, orthopedic trauma care, but its safe and effective adoption requires substantial methodological maturation.

## Full-text entities

- **Diseases:** Wrist Fracture (MESH:D000092503), Multiple Fractures (MESH:D000069076), Elbow Fractures (MESH:D000092482), Pelvic Fracture (MESH:D034161), Hand Fracture (MESH:D006230), degenerative joints (MESH:D019636), aortic injuries (MESH:D001018), Fracture (MESH:D050723), Humerus Fractures (MESH:D006810), venous thromboembolism (MESH:D054556), Physical Trauma (MESH:D000070617), ARTIFICIAL (MESH:D060437), femoral shaft fractures (MESH:D005264), Scapular Fracture (MESH:C566638), proximal femur fracture (MESH:D000092526), Ankle Fractures (MESH:D064386), knee (MESH:D007718), calcaneus fractures (MESH:D000070558), XAI (MESH:C538243), Osteoporosis (MESH:D010024), wound infection (MESH:D014946), Burn Trauma (MESH:D002056), femoral neck fracture (MESH:D005265), neurological deficits (MESH:D009461), critically ill (MESH:D016638), hematoma (MESH:D006406), proximal humeral fractures (MESH:D012784), Hip Fracture (MESH:D006620), Mortality (MESH:D003643), Bone Injury (MESH:D001847), Orbital Blowout Fracture (MESH:D009917), burst fractures (MESH:C562695), Femur Fracture (MESH:D000092524), metacarpal and phalangeal fractures (MESH:C535863), LEARNING MODELS (MESH:D007859), Fractures of Pelvis and Acetabulum (MESH:D010386), comminution (MESH:D018460), Arterial Injury (MESH:D057772), Postoperative Complications (MESH:D011183), distal humerus fracture (MESH:D000092483), Tibial Plateau Fractures (MESH:D000092463), Fibrocartilage Complex Injuries (MESH:D048090), Injury (MESH:D014947), Cervical spine fracture (MESH:D002575), Colles' fracture (MESH:D003100), posterior ligamentous complex injury (MESH:D017887), Infection (MESH:D007239), elbow effusions (MESH:D000092464), postoperative (MESH:D019106), Tibial Shaft Fractures (MESH:D013978), blunt (MESH:D014949), Major Trauma (MESH:D004830), dislocations (MESH:D004204), bladder rupture (MESH:D012421), DISTAL (MESH:D049310), AI (MESH:C538142), fractured fragments (MESH:D012892), Musculoskeletal Disorders (MESH:D009140)
- **Chemicals:** YOLOX (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13033022/full.md

---
Source: https://tomesphere.com/paper/PMC13033022