Text-Based Personas for Simulating User Privacy Decisions
Kassem Fawaz, Ren Yi, Octavian Suciu, Rishabh Khandelwal, Hamza Harkous, Nina Taft, Marco Gruteser

TL;DR
Narriva generates human-readable synthetic privacy personas grounded in prior user decisions, improving privacy decision simulation accuracy and efficiency over baseline methods.
Contribution
It introduces a novel persona generation method based on actual privacy decisions, enhancing simulation accuracy and reducing prompt tokens compared to existing approaches.
Findings
Achieves up to 87% predictive accuracy in modeling privacy preferences.
Reduces prompt tokens by 80-95% compared to in-context learning.
Synthesized personas from one survey replicate behaviors of different studies.
Abstract
The ability to simulate human privacy decisions has significant implications for aligning autonomous agents with individual intent and conducting cost-effective, large-scale privacy-centric user studies. Prior approaches prompt Large Language Models (LLMs) with natural language user statements, data-sharing histories, or demographic attributes to simulate privacy decisions. These approaches, however, fail to balance individual-level accuracy, human auditability, token efficiency, and population-level representation. We present Narriva, an approach that generates text-based synthetic privacy personas to address these shortcomings. Narriva grounds persona generation in prior user privacy decisions, such as those from large-scale survey datasets, rather than purely relying on demographic stereotypes. It compresses this data into concise, human-readable summaries structured by established…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
