Importance of diagnostic accuracy in big data: False-positive diagnoses of type 2 diabetes in health insurance claims data of 70 million Germans
Ralph Brinks, Thaddaeus Toennies, Annika Hoyer

TL;DR
This study estimates the false-positive diagnosis rate of type 2 diabetes in Germany's health insurance claims data of 70 million people, highlighting the importance of accounting for diagnostic errors in big data research.
Contribution
It introduces a mathematical method to estimate false-positive diagnosis rates in large claims data sets, specifically applied to type 2 diabetes in Germany.
Findings
False-positive rate varies with age and sex.
Approximately 217,000 false-positive cases identified.
Higher false-positive rates in women across all age groups.
Abstract
Large data sets comprising diagnoses about chronic conditions are becoming increasingly available for research purposes. In Germany, it is planned that aggregated claims data including medical diagnoses from the statutory health insurance with roughly 70 million insurants will be published on a regular basis. Validity of the diagnoses in such big data sets can hardly be assessed. In case the data set comprises prevalence, incidence and mortality, it is possible to estimate the proportion of false positive diagnoses using mathematical relations from the illness-death model. We apply the method to age-specific aggregated claims data from 70 million Germans about type 2 diabetes in Germany stratified by sex and report the findings in terms of the ratio of false positive diagnoses of type 2 diabetes (FPR) in the data set. The age-specific FPR for men and women changes with age. In men, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth Promotion and Cardiovascular Prevention · Chronic Disease Management Strategies · Diabetes Management and Education
