# Developing and validating machine learning models to predict acetabular cup size in total hip arthroplasty

**Authors:** Felix C. Oettl, Aaron I. Weinblatt, Brian Chalmers, David Kolin, Alejandro Gonzalez Della Valle

PMC · DOI: 10.1016/j.jor.2025.07.021 · Journal of Orthopaedics · 2025-07-23

## TL;DR

This study uses machine learning to accurately predict the size of hip implants needed for surgery, which can help reduce costs and improve inventory management.

## Contribution

The novel use of machine learning models to predict acetabular cup size in THA with high accuracy based on preoperative metrics.

## Key findings

- Quantile regression forest outperformed EBM in predicting cup size within ±2 mm accuracy (82.85%).
- Sex, height, age, weight, surgical approach, and BMI were the most important predictors of cup size.
- Machine learning models can significantly improve inventory efficiency and cost savings in THA.

## Abstract

Adequate implant inventory management can improve efficiency, storage space, and result in cost savings in arthroplasty. This study investigates if the prediction of cup size in elective primary total hip arthroplasty (THA) cound be improved with the use of advanced machine learning.

Using the arthroplasty registry of a single institution, we identified 30,583 patients who underwent primary THA between 2016 and 2024. No data was missing or incomplete. A total of 9 parameters readily available preoperatively were included as potential predictor variables. The data corpus was partitioned into training (80 %) and hold-out test (20 %) samples. Two distinct machine learning models were trained on regression tasks. The models were technically evaluated utilizing Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). Spearman correlation coefficient was calculated to assess alignment with implanted cup. 95 % confidence intervals (95 % CI) were calculated via bootstrapping. Real world useability was assessed by the percent of correct predictions within ±2 mm from implanted cup.

The quantile regression forest outperformed the explainable boosted machine (EBM) in terms of MAE (1.69 [95 % CI 1.64, 1.73] vs 1.73 [1.69, 1.77]) and real-world usability, with an accuracy of 82.85 % within ±2 mm and 97.27 % within ±4 mm. The EBM outperformed the QRF by RMSE and Spearman Correlation coefficient, weighing outliers heavier. The most important factors in order were Sex, height, age, weight, surgical approach and BMI.

Machine learning models can predict implant sizing with very high accuracy based on a few metrics available preoperatively. This model can help decrease overall cost of THA by improving orthopaedic manufacturers' supply chains and hospitals’ inventory management.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12312115/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12312115/full.md

## References

11 references — full list in the complete paper: https://tomesphere.com/paper/PMC12312115/full.md

---
Source: https://tomesphere.com/paper/PMC12312115