# Why NHS hospital co-morbidity research may be wrong: how clinical coding fails to identify the impact of diabetes mellitus on cancer survival

**Authors:** K. Zucker, C. McInerney, A, Glaser, P. Baxter, G. Hall

PMC · DOI: 10.1038/s41416-025-03136-9 · British Journal of Cancer · 2025-08-09

## TL;DR

This study shows that hospital coding often misses diabetes diagnoses in cancer patients, leading to incorrect survival estimates.

## Contribution

The paper demonstrates how clinical coding misclassifies diabetes, affecting cancer survival analysis and research validity.

## Key findings

- Clinical coding missed 14.6% of diabetic cancer patients.
- Relying on coding overestimates diabetes' negative impact on cancer survival.
- Temporal misclassification rate was 17.5%, affecting analytic outcomes.

## Abstract

Significant volumes of research rely on secondary care diagnostic coding to identify comorbidities however little is known about its accuracy at a population level or if this influences subsequent analysis.

Retrospective observational study utilising real world data for all cancers, prostate cancer and breast cancer patients diagnosed at Leeds Cancer Centre from 2005 and 2018. Three different data definitions were used to identify patients with diabetes in each cohort: (1) clinical coding alone, (2) HbA1c blood test alone (3) either clinical coding or abnormal HbA1c. Cohort characteristics, diagnosis dates and Cox derived survival was compared across diabetes definitions.

123,841 cancer patients were identified including 13,964 with diabetes. Clinical coding failed to identify 14.6% of diabetic cancer patients with a temporal misclassification rate of 17.5%. Sole reliance on clinical coding overestimated the negative effect of DM on median survival across all cancers and 3.17 years in breast cancer.

Clinical coding provides inaccurate diabetes diagnosis date and detection resulting in meaningful differences in analytic outcomes. This supports the use of more detailed comorbidity data definitions. Results casts doubt over research reliant on hospital clinical coding alone and the generalisability of some comorbidity and frailty scoring systems.

## Linked entities

- **Diseases:** diabetes mellitus (MONDO:0005015), cancer (MONDO:0004992), breast cancer (MONDO:0004989), prostate cancer (MONDO:0005159)

## Full-text entities

- **Diseases:** diabetes (MESH:D003920), breast cancer (MESH:D001943), prostate cancer (MESH:D011471), Cancer (MESH:D009369), DM (MESH:D009223)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12532788/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12532788/full.md

## References

7 references — full list in the complete paper: https://tomesphere.com/paper/PMC12532788/full.md

---
Source: https://tomesphere.com/paper/PMC12532788