# AI-based image quality assessment of positioning in mammography: considerations and challenges

**Authors:** Tina Santner, Mickael Tardy, Johanne-Gro Stalheim, Stephanie Frei, Wolfram Santner, Stefano Gianolini, Malik Galijasevic, Marthe Larsen, Jonas Gjesvik, Solveig Hofvind, Gerlig Widmann

PMC · DOI: 10.1186/s13244-025-02191-3 · Insights into Imaging · 2026-02-16

## TL;DR

This paper explores how well an AI algorithm can replicate human assessments of mammogram image quality, finding that agreement varies significantly across different categories.

## Contribution

The study evaluates an AI prototype's ability to replicate the PGMI system for mammogram quality assessment, highlighting challenges in automated classification.

## Key findings

- Slight agreement between AI and human experts for CC views (κ = 0.14) and fair for MLO views (κ = 0.25).
- Highest agreement in CC category 'M. Pectoralis visibility' (κ = 0.75) and MLO 'Pectoralis angle' (κ = 0.49).
- Disagreements revealed misinterpretations of anatomical landmarks and categorization causality issues.

## Abstract

Artificial intelligence (AI) could facilitate and objectify quality assessment in the daily routine. The purpose was to explore the extent to which an AI prototype algorithm is able to replicate the perfect-good-moderate-inadequate (PGMI) system (perfect, good, moderate, inadequate).

From a multicentre case collection, 200 standard mammograms (800 images) were selected. A deep learning-based prototype software was used to rate the images in analogy to the PGMI system. The AI results were compared with a reference standard obtained through consensus reading by three expert radiographers and one expert radiologist, using quadratically weighted Cohen’s kappa with confidence intervals (CI) and context-based interpretation. Frequency and reasons for disagreement were evaluated for challenging cases with a discrepancy of two or more grades and a discrepancy in assigning an inadequate.

For overall PGMI per image, slight agreement between human consensus and AI was observed for CC views (κ = 0.14) and fair agreement for MLO views (κ = 0.25). The highest agreement was observed for the CC category “M. Pectoralis visibility” (substantial, κ = 0.75). Best category in MLO was “Pectoralis angle” (moderate, κ = 0.49). For other categories, fair, slight or poor agreement was observed. The work-up of disagreement gave insight into misinterpretations of anatomical landmarks and causality issues in the categorization.

Transforming the PGMI system into a fully automated AI algorithm is challenging and may differ substantially between subcategories. Further research in computer science and quality assessment methodology is needed to pave the way for AI-based objective quality management in mammography.

Profound evaluation of AI algorithms and their ability to replicate human interpretation, scoring, and classification are the basis and scientific framework toward AI-based objective quality management in mammography.

AI has huge potential for automated assessment of diagnostic image quality.Compared with human reading agreement, substantial disagreement may also be found.Direct transformation of perfect-good-moderate-inadequate scoring into an AI algorithm is challenging.

AI has huge potential for automated assessment of diagnostic image quality.

Compared with human reading agreement, substantial disagreement may also be found.

Direct transformation of perfect-good-moderate-inadequate scoring into an AI algorithm is challenging.

## Full-text entities

- **Diseases:** cancer (MESH:D009369), MLO (MESH:C537736), AI (MESH:C538142), PNL (MESH:C000626393), PGMI (MESH:D012892), M. Pectoralis (MESH:C566367), Breast cancer (MESH:D001943)
- **Chemicals:** MLO (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12909628/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12909628/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12909628/full.md

---
Source: https://tomesphere.com/paper/PMC12909628