# Inter-observer variability in scoring of white matter injury on brain magnetic resonance imaging in moderate-to-late preterm infants

**Authors:** Kyle Grabowski, Liam Olsen, Greg Gamble, David Perry, Jane Harding

PMC · DOI: 10.1007/s00247-025-06297-0 · Pediatric Radiology · 2025-06-20

## TL;DR

This study shows that a brain MRI scoring system for white matter injury is reliable in moderate-to-late preterm infants, but injury detection decreases as infants approach term-equivalent age.

## Contribution

Demonstrates high inter-observer reliability of a white matter injury scoring system in moderate-late preterm infants and reveals changes in injury detection over time.

## Key findings

- Inter-observer agreement for white matter injury presence was near perfect (kappa 0.88 and 0.81).
- Incidence of white matter injury decreased from 30% to 22% between first and second MRI scans.
- Severity of injury also decreased between scans, with substantial agreement at the second scan (kappa 0.80).

## Abstract

Punctate white matter injury on brain magnetic resonance imaging (MRI) is described in very preterm infants (< 32 weeks’ gestation) and is predictive of poorer developmental outcomes. The reliability of scoring and the incidence and evolution of white matter injury in moderate-late preterm infants is unknown.

To assess inter-observer variability in white matter injury using a published scoring system (UCSF system), and to describe changes over time in moderate-late preterm infants.

Infants born between 32 + 0 and 36 + 6 weeks’ gestation in the Auckland region underwent MRI scans as soon as clinically feasible after birth and again at term-equivalent age. De-identified scans were scored independently by two observers. White matter injury was graded as minimal (< 3 lesions measuring < 2 mm), moderate (> 3 lesions or lesions > 2 mm), or severe (> 5% hemispheric involvement). Scores were compared between reviewers using weighted and unweighted kappa statistics interpreted using Cohen’s criteria. Incidences were compared between scans using generalised estimating equations.

Scans of 101 infants were assessed. Inter-observer agreement was near perfect for the presence of white matter injury (k = 0.88 and 0.81 for the first and second scan respectively), and for the severity of white matter injury was near perfect at the first scan (k = 0.85) and substantial at the second scan (k = 0.80). The incidence of white matter injury detected by the two observers decreased between the first and second scans (30% to 22% and 29% to 19%), and severity also decreased.

This scoring system can be reliably applied in moderate-late preterm infants. White matter injury is common in moderate-late preterm infants but may be underestimated when MRI is performed close to term-equivalent age.

## Full-text entities

- **Diseases:** Punctate white matter injury (MESH:D056784)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12321648/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12321648/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC12321648/full.md

---
Source: https://tomesphere.com/paper/PMC12321648