Patient-Driven Privacy Control through Generalized Distillation
Z. Berkay Celik, David Lopez-Paz, Patrick McDaniel

TL;DR
This paper introduces privacy distillation, a method enabling patients to control their data sharing in medical models, maintaining high accuracy while protecting sensitive information, demonstrated through personalized warfarin dosage predictions.
Contribution
It presents a novel privacy distillation technique that balances patient privacy with model accuracy in healthcare data analysis.
Findings
Privacy distillation retains 97% of full-data model accuracy.
It reduces errors in dosage predictions by 3.9%.
Effective in preventing health-related risks due to data withholding.
Abstract
The introduction of data analytics into medicine has changed the nature of patient treatment. In this, patients are asked to disclose personal information such as genetic markers, lifestyle habits, and clinical history. This data is then used by statistical models to predict personalized treatments. However, due to privacy concerns, patients often desire to withhold sensitive information. This self-censorship can impede proper diagnosis and treatment, which may lead to serious health complications and even death over time. In this paper, we present privacy distillation, a mechanism which allows patients to control the type and amount of information they wish to disclose to the healthcare providers for use in statistical models. Meanwhile, it retains the accuracy of models that have access to all patient data under a sufficient but not full set of privacy-relevant information. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
