To share or not to share: What risks would laypeople accept to give sensitive data to differentially-private NLP systems?
Christopher Weiss, Frauke Kreuter, Ivan Habernal

TL;DR
This study investigates how laypeople perceive privacy risks in NLP systems using differential privacy, revealing the privacy thresholds that influence their willingness to share sensitive data through behavioral experiments.
Contribution
It introduces the first behavioral experiment assessing laypeople's privacy thresholds for sharing sensitive data under differential privacy in NLP.
Findings
Laypeople's willingness to share sensitive data varies with privacy budget levels.
Risk perception significantly influences data sharing decisions.
The study provides insights into setting privacy parameters aligned with user comfort.
Abstract
Although the NLP community has adopted central differential privacy as a go-to framework for privacy-preserving model training or data sharing, the choice and interpretation of the key parameter, privacy budget that governs the strength of privacy protection, remains largely arbitrary. We argue that determining the value should not be solely in the hands of researchers or system developers, but must also take into account the actual people who share their potentially sensitive data. In other words: Would you share your instant messages for of 10? We address this research gap by designing, implementing, and conducting a behavioral experiment (311 lay participants) to study the behavior of people in uncertain decision-making situations with respect to privacy-threatening situations. Framing the risk perception in terms of two realistic NLP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy, Security, and Data Protection · Ethics and Social Impacts of AI
