An empirical assessment of differential privacy in real-world observational data: a case-control study of asthma exacerbation in UK Biobank linked with electronic health records
Mehrdad A Mizani, Aziz Sheikh, Amitava Banerjee

TL;DR
This study evaluates how differential privacy affects the results of a case-control analysis on asthma exacerbation using real-world health data.
Contribution
The paper empirically assesses differential privacy's impact on case-control study outcomes and provides insights for selecting privacy parameters.
Findings
Differential privacy altered odds ratios, sometimes mimicking misclassification and false-positive bias.
Rare covariates showed greater variability in odds ratios under differential privacy.
Epsilon values below ln(2) caused unstable results, suggesting the need for averaging or fixed random seeds.
Abstract
Electronic health records (EHRs) provide substantial resources for observational studies, yet present significant challenges in safeguarding patient privacy while maintaining research quality. Differential privacy (DP) offers a quantifiable privacy guarantee; however, its impact on observational studies remains underexplored. We empirically evaluated the effects of DP across varying values of its privacy parameter, epsilon, on case-control analysis outcomes using EHR data. This study aims to inform DP parameter selection and examines the influence of study characteristics on differentially private observational studies. We assessed the effects of DP on a case-control study of 1-year asthma exacerbations, including 22 165 participants with a history of asthma from UK Biobank linked to EHR data. Odds ratios (ORs) for sociodemographic factors and comorbidities were analyzed using adjusted…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Ethics in Clinical Research · Patient Dignity and Privacy
