To share or not to share: What risks would laypeople accept to give   sensitive data to differentially-private NLP systems?

Christopher Weiss; Frauke Kreuter; Ivan Habernal

arXiv:2307.06708·cs.CL·March 26, 2024·1 cites

To share or not to share: What risks would laypeople accept to give sensitive data to differentially-private NLP systems?

Christopher Weiss, Frauke Kreuter, Ivan Habernal

PDF

Open Access

TL;DR

This study investigates how laypeople perceive privacy risks in NLP systems using differential privacy, revealing the privacy thresholds that influence their willingness to share sensitive data through behavioral experiments.

Contribution

It introduces the first behavioral experiment assessing laypeople's privacy thresholds for sharing sensitive data under differential privacy in NLP.

Findings

01

Laypeople's willingness to share sensitive data varies with privacy budget levels.

02

Risk perception significantly influences data sharing decisions.

03

The study provides insights into setting privacy parameters aligned with user comfort.

Abstract

Although the NLP community has adopted central differential privacy as a go-to framework for privacy-preserving model training or data sharing, the choice and interpretation of the key parameter, privacy budget $ε$ that governs the strength of privacy protection, remains largely arbitrary. We argue that determining the $ε$ value should not be solely in the hands of researchers or system developers, but must also take into account the actual people who share their potentially sensitive data. In other words: Would you share your instant messages for $ε$ of 10? We address this research gap by designing, implementing, and conducting a behavioral experiment (311 lay participants) to study the behavior of people in uncertain decision-making situations with respect to privacy-threatening situations. Framing the risk perception in terms of two realistic NLP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy, Security, and Data Protection · Ethics and Social Impacts of AI