# Comprehensive Evaluation of Quality Indicators: Analyzing the Dutch Breast Cancer Audit

**Authors:** Elfi M. Verheul, Margrietha van der Linde, Hester F. Lingsma, Elvira Vos, Sabine Siesling, Linetta B. Koppert

PMC · DOI: 10.34172/ijhpm.8943 · International Journal of Health Policy and Management · 2025-10-01

## TL;DR

This study evaluates 18 quality indicators for breast cancer care in the Netherlands, finding that while most meet basic criteria, none are fully reliable for public reporting due to low rankability.

## Contribution

A data-driven evaluation framework for quality indicators in breast cancer care, applied to the Dutch audit.

## Key findings

- Feasibility of 15 out of 18 QIs was above 80.2%, meeting the threshold for practical use.
- Only 8 QIs showed good discriminative ability with an interquartile range above 10%.
- None of the QIs achieved high rankability (>75%), raising concerns about their reliability for public reporting.

## Abstract

Quality indicators (QIs) are widely used to benchmark hospital performance and improve quality of care but are often based on expert opinion rather than data-driven assessment. This study aims to evaluate QIs, using a framework that assesses four criteria: Feasibility, discriminative ability, validity, and reliability.

We used data from the Dutch breast cancer registry (NABON Breast Cancer Audit, NBCA) and included all surgically treated breast cancer patients in the Netherlands between 2021-2023. Eighteen QIs were evaluated. Feasibility was determined by QI numerator completeness, with >90% data availability considered feasible. Discriminative ability was assessed by between-hospital variation in QI scores, where an interquartile range (IQR) >10% indicated good discrimination. Validity was evaluated by the impact of case-mix adjustment and considered low when (pseudo-) R2 <0.10. The (pseudo-)R2 reflects the proportion of variance in QI scores explained by all case-mix variables in a regression model. Reliability was assessed by rankability, the proportion of between-hospital variation not due to chance and, therefore, explainable by quality of care. Rankability >75% was considered high.

After exclusion of one QI with feasibility <1%, feasibility ranged from 80.2%-100%, and 15 QIs were feasible. Overall, the IQR ranged from 1 to 36, with 8 QIs having an IQR higher than 10, indicating good discriminative ability. The (pseudo-)R2 ranged from 0.01-0.53, with 11 QIs showing low case-mix impact. Rankability ranged from 0-69%, with none of the QIs having a high rankability. None of the QIs met all preset criteria, but six QIs met at least three out of four criteria.

The QIs employed by the Dutch breast cancer registry fulfilled most criteria, but rankability is a concern and requires specific attention, especially for public reporting. Our results show the importance of considering feasibility, discriminative ability, validity, and reliability when evaluating QIs, and these should also be taken into account when developing new QIs.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Diseases:** Breast Cancer (MESH:D001943)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12958241/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC12958241/full.md

---
Source: https://tomesphere.com/paper/PMC12958241