LLM-as-a-Judge for Privacy Evaluation? Exploring the Alignment of Human and LLM Perceptions of Privacy in Textual Data

Stephen Meisenbacher; Alexandra Klymenko; and Florian Matthes

arXiv:2508.12158·cs.CL·August 19, 2025

LLM-as-a-Judge for Privacy Evaluation? Exploring the Alignment of Human and LLM Perceptions of Privacy in Textual Data

Stephen Meisenbacher, Alexandra Klymenko, and Florian Matthes

PDF

TL;DR

This paper investigates whether large language models can serve as reliable judges of privacy sensitivity in text, comparing their assessments with human perceptions across multiple datasets and analyzing their potential and limitations.

Contribution

It introduces the use of LLMs as privacy evaluators, demonstrating their ability to model human privacy perspectives despite the subjective nature of privacy.

Findings

01

LLMs can accurately model a global human privacy perspective

02

Privacy perception shows low inter-human agreement, indicating its complexity

03

LLMs exhibit both merits and limitations in privacy evaluation

Abstract

Despite advances in the field of privacy-preserving Natural Language Processing (NLP), a significant challenge remains the accurate evaluation of privacy. As a potential solution, using LLMs as a privacy evaluator presents a promising approach $\unicode x 2013$ a strategy inspired by its success in other subfields of NLP. In particular, the so-called $LLM-as-a-Judge$ paradigm has achieved impressive results on a variety of natural language evaluation tasks, demonstrating high agreement rates with human annotators. Recognizing that privacy is both subjective and difficult to define, we investigate whether LLM-as-a-Judge can also be leveraged to evaluate the privacy sensitivity of textual data. Furthermore, we measure how closely LLM evaluations align with human perceptions of privacy in text. Resulting from a study involving 10 datasets, 13 LLMs, and 677 human survey…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.