LLM-as-a-Judge for Privacy Evaluation? Exploring the Alignment of Human and LLM Perceptions of Privacy in Textual Data
Stephen Meisenbacher, Alexandra Klymenko, and Florian Matthes

TL;DR
This paper investigates whether large language models can serve as reliable judges of privacy sensitivity in text, comparing their assessments with human perceptions across multiple datasets and analyzing their potential and limitations.
Contribution
It introduces the use of LLMs as privacy evaluators, demonstrating their ability to model human privacy perspectives despite the subjective nature of privacy.
Findings
LLMs can accurately model a global human privacy perspective
Privacy perception shows low inter-human agreement, indicating its complexity
LLMs exhibit both merits and limitations in privacy evaluation
Abstract
Despite advances in the field of privacy-preserving Natural Language Processing (NLP), a significant challenge remains the accurate evaluation of privacy. As a potential solution, using LLMs as a privacy evaluator presents a promising approach a strategy inspired by its success in other subfields of NLP. In particular, the so-called paradigm has achieved impressive results on a variety of natural language evaluation tasks, demonstrating high agreement rates with human annotators. Recognizing that privacy is both subjective and difficult to define, we investigate whether LLM-as-a-Judge can also be leveraged to evaluate the privacy sensitivity of textual data. Furthermore, we measure how closely LLM evaluations align with human perceptions of privacy in text. Resulting from a study involving 10 datasets, 13 LLMs, and 677 human survey…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
