Automated Privacy Information Annotation in Large Language Model Interactions

Hang Zeng; Xiangyu Liu; Yong Hu; Chaoyue Niu; Fan Wu; Shaojie Tang; Guihai Chen

arXiv:2505.20910·cs.CL·August 11, 2025

Automated Privacy Information Annotation in Large Language Model Interactions

Hang Zeng, Xiangyu Liu, Yong Hu, Chaoyue Niu, Fan Wu, Shaojie Tang, Guihai Chen

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces a large-scale multilingual dataset and automated annotation pipeline for detecting private information leaks in user interactions with large language models, aiming to improve privacy protection.

Contribution

It presents a novel dataset and automated annotation method for privacy leak detection in LLM interactions, tailored for deployment on local devices.

Findings

01

Baseline models show performance gaps in real-world privacy detection.

02

The dataset enables comprehensive evaluation of privacy leakage detection methods.

03

Evaluation metrics cover privacy leakage, phrase extraction, and information accuracy.

Abstract

Users interacting with large language models (LLMs) under their real identifiers often unknowingly risk disclosing private information. Automatically notifying users whether their queries leak privacy and which phrases leak what private information has therefore become a practical need. Existing privacy detection methods, however, were designed for different objectives and application domains, typically tagging personally identifiable information (PII) in anonymous content, which is insufficient in real-name interaction scenarios with LLMs. In this work, to support the development and evaluation of privacy detection models for LLM interactions that are deployable on local user devices, we construct a large-scale multilingual dataset with 249K user queries and 154K annotated privacy phrases. In particular, we build an automated privacy annotation pipeline with strong LLMs to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nidryen-zh/privacyannotation
pytorchOfficial

Datasets

Nidhogg-zh/Interaction_Dialogue_with_Privacy
dataset· 271 dl
271 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection