Operationalizing Contextual Integrity in Privacy-Conscious Assistants

Sahra Ghalebikesabi; Eugene Bagdasaryan; Ren Yi; Itay Yona; Ilia; Shumailov; Aneesh Pappu; Chongyang Shi; Laura Weidinger; Robert Stanforth,; Leonard Berrada; Pushmeet Kohli; Po-Sen Huang; Borja Balle

arXiv:2408.02373·cs.AI·September 16, 2024·2 cites

Operationalizing Contextual Integrity in Privacy-Conscious Assistants

Sahra Ghalebikesabi, Eugene Bagdasaryan, Ren Yi, Itay Yona, Ilia, Shumailov, Aneesh Pappu, Chongyang Shi, Laura Weidinger, Robert Stanforth,, Leonard Berrada, Pushmeet Kohli, Po-Sen Huang, Borja Balle

PDF

Open Access

TL;DR

This paper introduces a method to align AI assistants' information sharing with privacy norms using the framework of contextual integrity, evaluated through a novel webform benchmark and prompting strategies.

Contribution

It operationalizes the concept of contextual integrity to steer AI assistants' information sharing, enhancing privacy compliance in complex tasks.

Findings

01

Prompting LLMs with CI-based reasoning improves privacy compliance.

02

The novel webform benchmark effectively evaluates privacy-aware assistant behavior.

03

Strategies based on CI can significantly reduce inappropriate information sharing.

Abstract

Advanced AI assistants combine frontier LLMs and tool access to autonomously perform complex tasks on behalf of users. While the helpfulness of such assistants can increase dramatically with access to user information including emails and documents, this raises privacy concerns about assistants sharing inappropriate information with third parties without user supervision. To steer information-sharing assistants to behave in accordance with privacy expectations, we propose to operationalize contextual integrity (CI), a framework that equates privacy with the appropriate flow of information in a given context. In particular, we design and evaluate a number of strategies to steer assistants' information-sharing actions to be CI compliant. Our evaluation is based on a novel form filling benchmark composed of human annotations of common webform applications, and it reveals that prompting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy, Security, and Data Protection · Privacy-Preserving Technologies in Data · Cognitive Functions and Memory