# Automated AI fracture detection in initial presentation pediatric wrist X-rays: effects and benefits of adding follow-up examinations

**Authors:** Mario Scherkl, Nikolaus Stranger, Andreea Ciornei-Hoffman, Georg Singer, Tristan Till, Holger Till, Franko Hržić, Sebastian Tschauner

PMC · DOI: 10.1007/s11547-025-02153-1 · 2025-11-13

## TL;DR

This study explores how adding follow-up X-rays affects AI's ability to detect fractures in children's wrist X-rays, finding benefits mainly in object detection models.

## Contribution

The study is the first to investigate the impact of follow-up X-rays on AI performance for pediatric wrist fracture detection.

## Key findings

- EfficientNet models showed no significant improvement with follow-up X-rays in classification performance.
- YOLOv8 models showed improved object detection metrics (AP50 and F1 score) when follow-up X-rays were included.
- Including both cast and non-cast follow-up X-rays led to the most significant improvements in object detection.

## Abstract

Artificial Intelligence (AI) in radiology has shown promise in detecting fractures on initial X-rays. However, the role of follow-up examinations in enhancing AI performance remains unexplored. This study evaluates the impact of including follow-up X-rays on the performance of neural networks in detecting pediatric wrist fractures.

Using the publicly available GRAZPEDWRI-DX dataset of 20,327 pediatric wrist X-rays, we created four training datasets: initial X-rays alone and combinations with follow-up X-rays (with and without casts). Two neural networks, EfficientNet (image classification) and YOLOv8 (object detection), were trained and evaluated using precision, recall, F1 score, and AP metrics. The dataset was divided into training, validation, and test sets, with 500 initial X-rays separated and reserved for testing.

EfficientNet models showed no statistically significant improvements in classification performance with the inclusion of follow-up X-rays. In contrast, YOLOv8 demonstrated improved object detection metrics, particularly AP50 (p = 0.003) and F1 score (p = 0.009), when follow-up X-rays were included. The improvement was most evident when both cast and non-cast follow-ups were incorporated.

Adding follow-up X-rays did not enhance classification performance but improved fracture localization in object detection tasks. These findings suggest that including follow-up data shows no relevant improvement in the detection rate of fractures but can enhance AI applications for pediatric wrist fracture detection, particularly for object detection models.

## Full-text entities

- **Diseases:** wrist fracture (MESH:D000092503), fracture (MESH:D050723), AI fracture (MESH:C538142)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12982319/full.md

---
Source: https://tomesphere.com/paper/PMC12982319