# Effect of TRIPOD+AI Guidelines on the Reporting Quality of Artificial Intelligence Prediction Models in Orthopaedic Surgery: An 18-Month Bibliometric Study

**Authors:** Shashwat Singh

PMC · DOI: 10.7759/cureus.97176 · 2025-11-18

## TL;DR

This study found that the TRIPOD+AI guidelines did not significantly improve the quality of reporting in AI prediction model abstracts in orthopaedic surgery over 18 months.

## Contribution

The study evaluates the real-world impact of the TRIPOD+AI guidelines on reporting quality in orthopaedic AI research.

## Key findings

- Performance measure reporting remained high but no abstract met all four TRIPOD+AI criteria.
- Outcome event reporting increased slightly, while participant number reporting declined.
- Confidence interval and study registration reporting remained poor despite the guidelines.

## Abstract

The TRIPOD+AI (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis plus Artificial Intelligence extension), published in April 2024, provides guidance for transparent reporting of artificial intelligence (AI)-based prediction models. It provides specific guidance for items to include in abstracts in this field. This study evaluated whether reporting quality in orthopaedic AI prediction model abstracts improved following the publication of TRIPOD+AI guidelines. We searched PubMed for English-language studies evaluating AI prediction models in orthopaedics across two 18-month periods: pre-TRIPOD+AI (October 2022 to April 2024) and post-TRIPOD+AI (April 2024 to October 2025). Abstract compliance was assessed against four TRIPOD+AI criteria: performance measure specification (Item 8), sample size and outcome events (Item 9), performance estimates with confidence intervals (Item 11), and study registration (Item 13). Reporting frequencies were compared using chi-squared tests. Among 522 eligible studies (pre-TRIPOD+AI=214, post-TRIPOD+AI=308), reporting of performance measures remained high (96.7% vs 98.4%, p=0.35). Full compliance with Item 9 showed a non-significant increase (32.7% to 39.9%, p=0.11). Reporting of outcome events increased from 36.0% to 44.5% (p=0.06), while participant number reporting declined from 82.2% to 75.0% (p=0.06). Confidence interval reporting remained low (18.7% vs 16.6%, p=0.61), and study registration was nearly absent (0.5% vs 1.0%, p=0.89). No abstract met all four criteria. Eighteen months after its publication, TRIPOD+AI has not measurably improved reporting quality in orthopaedic AI abstracts. Confidence interval reporting and study registration remain particularly deficient. These findings suggest that guideline dissemination alone may be insufficient and that active journal-level implementation strategies may be needed to improve reporting standards.

## Full-text entities

- **Chemicals:** Item 9 (-)

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12627258/full.md

---
Source: https://tomesphere.com/paper/PMC12627258