# Inter- and intra-observer agreement in ultrasound diagnosis of steatotic liver disease: implications for screening in resource-limited settings

**Authors:** Maria Spencer-Sandino, Maya Balakrishnan, David Wynne, Ilona Argirion, Paz Cook, Vanessa Van De Wyngard, Noldy Mardones, Ruth Pfeiffer, Allan Hildesheim, Catterina Ferreccio, Jill Koshiol

PMC · DOI: 10.1038/s41598-025-07862-1 · 2025-08-14

## TL;DR

This study shows that ultrasound diagnosis of liver disease has inconsistent results between and within observers, suggesting the need for better training and quality control in high-risk populations.

## Contribution

The study quantifies inter- and intra-observer variability in ultrasound-based SLD diagnosis in a high-risk cohort.

## Key findings

- Inter-observer agreement between a radiologist and technicians was slight to fair.
- Intra-observer agreement was moderate to substantial for some observers.
- The results highlight the need for quality control in ultrasound-based SLD screening.

## Abstract

Steatotic liver disease (SLD), which is associated with increased risk of cancer-related mortality, needs timely and cost-effective detection. Although liver biopsy remains the diagnostic gold standard, its invasiveness and high-cost limit widespread use. Ultrasound is a practical and affordable alternative. We evaluated inter- and intra-observer agreement for ultrasound-based diagnosis of SLD using images from the Chile Biliary Longitudinal Study (Chile BiLS), a cohort of women with gallstones. These women have a high burden of obesity and related metabolic disorders, putting them at higher risk for SLD. A radiologist (observer 1) reviewed a randomly selected subset of 425 baseline images and compared them with the original readings from Chile BiLS radiology technicians. To assess intra-observer reproducibility, observer 1 reanalyzed 34 blinded duplicates, and two Chile BiLS radiology technicians (observers 2 and 3) independently reviewed these images. Observer 2 then re-reviewed the 34 images to assess intra-observer agreement. Agreement was analyzed using kappa and percent agreement. Observer 1 had slight inter-observer agreement (kappa: 0.12; 95% CI 0.08–0.15, p < 0.001; percent agreement: 41.0%), while observers 2 and 3 showed fair agreement (kappa: 0.29: 95% CI 0.11–0.58, p < 0.05; percent agreement: 64.7% and kappa: 0.32: 95% CI 0.06–0.58, p < 0.05; percent agreement: 63.6%, respectively). Intra-observer agreement was moderate for observer 1 (kappa: 0.45; 95% CI 0.08–0.82, p < 0.05; percent agreement: 81.3%), and substantial for observer 2 (kappa: 0.64; 95% CI 0.37–0.90, p < 0.001; percent agreement: 81.8%). Our findings highlight variability in ultrasound interpretation, underscoring the necessity of inter- and intra-observer comparisons for optimal diagnosis and quality control to enhance diagnostic consistency in high-risk populations.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Diseases:** metabolic syndrome (MESH:D024821), renal disease (MESH:D007674), overweight (MESH:D050177), diabetes (MESH:D003920), Cancer (MESH:D009369), cirrhosis (MESH:D005355), obese (MESH:D009765), MASLD (MESH:D008107), gallstone disease (MESH:D002769), Health Disparities (MESH:D011019), gallbladder disease and cancer (MESH:D005706), hypertension (MESH:D006973), hepatocellular carcinoma (MESH:D006528), ALD (MESH:D008108), gallstones (MESH:D042882), abdominal adiposity (MESH:D000007), metabolic disorders (MESH:D008659), NAFLD (MESH:D065626), Liver steatosis (MESH:D005234), type 2 diabetes (MESH:D003924)
- **Chemicals:** triglyceride (MESH:D014280)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12354845/full.md

---
Source: https://tomesphere.com/paper/PMC12354845