PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance

Haoran Li; Wenbin Hu; Huihao Jing; Yulin Chen; Qi Hu; Sirui Han; Tianshu Chu; Peizhao Hu; Yangqiu Song

arXiv:2502.17041·cs.CL·May 26, 2025

PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance

Haoran Li, Wenbin Hu, Huihao Jing, Yulin Chen, Qi Hu, Sirui Han, Tianshu Chu, Peizhao Hu, Yangqiu Song

PDF

Open Access 1 Repo 1 Video

TL;DR

PrivaCI-Bench is a comprehensive benchmark for evaluating privacy in large language models based on Contextual Integrity theory, focusing on legal compliance and social context rather than just PII, revealing current models' limitations.

Contribution

This work introduces PrivaCI-Bench, a novel privacy evaluation benchmark that incorporates social context and legal regulations to assess LLMs' privacy and safety compliance.

Findings

01

LLMs can capture key CI parameters within a context

02

Current LLMs need improvements for full privacy compliance

03

Benchmark covers real court cases, policies, and synthetic data

Abstract

Recent advancements in generative large language models (LLMs) have enabled wider applicability, accessibility, and flexibility. However, their reliability and trustworthiness are still in doubt, especially for concerns regarding individuals' data privacy. Great efforts have been made on privacy by building various evaluation benchmarks to study LLMs' privacy awareness and robustness from their generated outputs to their hidden representations. Unfortunately, most of these works adopt a narrow formulation of privacy and only investigate personally identifiable information (PII). In this paper, we follow the merit of the Contextual Integrity (CI) theory, which posits that privacy evaluation should not only cover the transmitted attributes but also encompass the whole relevant social context through private information flows. We present PrivaCI-Bench, a comprehensive contextual privacy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HKUST-KnowComp/PrivaCI-Bench
pytorchOfficial

Videos

PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance· underline

Taxonomy

TopicsPrivacy, Security, and Data Protection

MethodsADaptive gradient method with the OPTimal convergence rate