# Machine Learning for Predicting Venous Thromboembolism After Joint Arthroplasty: Systematic Review of Clinical Applicability and Model Performance

**Authors:** Junwei Ma, Huifeng Tang, Yunshan Zhang, Xuemei Yi, Tangsheng Zhong, Xinyun Li, Gang Wang

PMC · DOI: 10.2196/79886 · JMIR Medical Informatics · 2026-02-12

## TL;DR

This review evaluates machine learning models for predicting blood clots after joint surgery, finding inconsistent performance and high bias risks.

## Contribution

The study systematically reviews ML models for venous thromboembolism prediction, highlighting methodological flaws and clinical applicability issues.

## Key findings

- 34 models from 9 studies showed significant heterogeneity and high risk of bias.
- Some models had high area under the curve values but lacked external validation.
- Models on large external datasets showed more conservative performance.

## Abstract

There is increasing research on machine learning in predicting venous thromboembolism after joint arthroplasty, but the quality and clinical applicability of these models remain uncertain.

This systematic review aims to evaluate the predictive performance and methodological quality of machine learning models for venous thromboembolism risk after joint replacement surgery.

Web of Science, Embase, Scopus, CNKI, Wanfang, Vipro, and PubMed were searched until December 15, 2024. The risk of bias and applicability were evaluated using the PROBAST (Prediction Model Risk of Bias Assessment Tool) checklist. A qualitative comprehensive analysis was conducted to extract and describe the data related to the model’s characteristics and performance.

This review encompassed 34 prediction models from 9 studies. The most frequently used machine learning models were extreme gradient boosting and logistic regression. The results showed that all studies had significant heterogeneity and high risk of bias. Although some models reported nearly flawless area under the curve (>0.9), they lacked external validation and may have overfitted. The models tested on large external datasets demonstrated more conservative performance.

The predictive performance of machine learning models varied greatly. Although the reported area under the curve values indicated that some models have good discriminative ability, this performance varied greatly and was inconsistent among the included studies. These models have a high risk of bias, and it is necessary to take this into account when they are used in clinical practice. Future studies should adopt a prospective study design, ensure appropriate data handling, and use external validation to improve model robustness and applicability.

## Linked entities

- **Diseases:** venous thromboembolism (MONDO:0005399)

## Full-text entities

- **Diseases:** Venous Thromboembolism (MESH:D054556)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12900511/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12900511/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/PMC12900511/full.md

---
Source: https://tomesphere.com/paper/PMC12900511