# Automated classification of shoulder radiology focusing on cuff tear arthropathy and glenoid erosion using AI

**Authors:** Michael Axenhus, Martin Magnéli, Jacob Gislén, Johan Fagrell, Petter Ling, Yilmaz Demir, Erica Domeij Arverud, Kristofer Hallberg, Björn Salomonsson, Max Gordon

PMC · DOI: 10.1186/s12891-026-09603-5 · 2026-02-25

## TL;DR

This study uses AI to classify shoulder disorders like cuff tear arthropathy and glenoid erosion from X-rays, showing strong performance in advanced disease stages.

## Contribution

A novel AI model is proposed for classifying CTA and glenoid erosion on plain radiographs using Hamada and Favard systems.

## Key findings

- The AI model achieved AUCs of 0.95 for Hamada grades 3 and 4.
- Performance for Hamada grades 0–2 and glenoid erosion ranged from AUCs of 0.81 to 0.91.
- The model showed robust accuracy with a heterogeneous dataset, suggesting potential for real-world use.

## Abstract

Recent advancements in the field artificial intelligence (AI), particularly in the architecture of convolutional neural network (CNN) architecture, have revolutionized medical imaging by enabling accurate image recognition. However, the application of AI in identifying degenerative musculoskeletal disorders, specifically on plain radiographs, is still poorly explored. The aim of this study is to classify cuff tear arthropathy (CTA) and glenoid erosion using AI on plain shoulder radiographs, using the Hamada and Favard classification systems.

We used a publicly available CNN trained for image recognition and trained it using a diverse dataset of 6733 shoulder and clavicle X-ray images covering various clinical conditions. The performance of the network was evaluated in detail on a validation set of 561 images. Metrics such as sensitivity, specificity, Youden’s index, and Area Under Curve (AUC) in the receiver operating characteristics curve analysis, were used for evaluation. AUC was the primary measure of accuracy.

The network showed exceptional performance in identifying Hamada grades 3 and 4, achieving AUCs of 0.95, 95% CI [0.91–0.98] for both categories. While performance was slightly lower for Hamada grades 0–2 and glenoid erosion, with AUCs ranging from 0.81 to 0.91, it still demonstrated considerable accuracy. Similar results were obtained for Favard although we could not validate the network for more advanced stages due to lack of data.

Our study demonstrates the network’s robust capability to identify CTA on plain radiographs, comparable to earlier studies focused on osteoarthritis. Notably, the network excelled in later disease stages characterized by pronounced pathology. The ability to achieve such performance with a heterogeneous dataset bodes well for the real-world implementation of AI technology. However, it is crucial to acknowledge the potential influence of using a validation set instead of a dedicated test set, warranting further investigation.

## Full-text entities

- **Diseases:** knee OA (MESH:D020370), Erosion of the glenoid (MESH:D014077), shoulder pain (MESH:D020069), AVN (MESH:D010020), CTA (MESH:D000070656), tendon calcification (MESH:D052256), scapular fractures (MESH:C566638), Fractures (MESH:D050723), fractures of the proximal humerus (MESH:D006810), glenohumeral arthritis (MESH:D001168), collapse (MESH:D001261), degenerative conditions (MESH:D019636), tears (MESH:D012167), OA (MESH:D010003), wear (MESH:D057085), humeral head collapse (MESH:D012784), clavicle fracture (MESH:C562548), Damage to the rotator cuff (MESH:D000070636), deformities to the glenohumeral joint (MESH:D012783), diaphysis fractures (MESH:C537569), sclerosis (MESH:D012598), degenerative musculoskeletal disorders (MESH:D009140), degenerative disorders of the knee and hip (MESH:C564185)
- **Chemicals:** GPU (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12961769/full.md

---
Source: https://tomesphere.com/paper/PMC12961769