PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance
Haoran Li, Wenbin Hu, Huihao Jing, Yulin Chen, Qi Hu, Sirui Han, Tianshu Chu, Peizhao Hu, Yangqiu Song

TL;DR
PrivaCI-Bench is a comprehensive benchmark for evaluating privacy in large language models based on Contextual Integrity theory, focusing on legal compliance and social context rather than just PII, revealing current models' limitations.
Contribution
This work introduces PrivaCI-Bench, a novel privacy evaluation benchmark that incorporates social context and legal regulations to assess LLMs' privacy and safety compliance.
Findings
LLMs can capture key CI parameters within a context
Current LLMs need improvements for full privacy compliance
Benchmark covers real court cases, policies, and synthetic data
Abstract
Recent advancements in generative large language models (LLMs) have enabled wider applicability, accessibility, and flexibility. However, their reliability and trustworthiness are still in doubt, especially for concerns regarding individuals' data privacy. Great efforts have been made on privacy by building various evaluation benchmarks to study LLMs' privacy awareness and robustness from their generated outputs to their hidden representations. Unfortunately, most of these works adopt a narrow formulation of privacy and only investigate personally identifiable information (PII). In this paper, we follow the merit of the Contextual Integrity (CI) theory, which posits that privacy evaluation should not only cover the transmitted attributes but also encompass the whole relevant social context through private information flows. We present PrivaCI-Bench, a comprehensive contextual privacy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy, Security, and Data Protection
MethodsADaptive gradient method with the OPTimal convergence rate
