Auditing M-LLMs for Privacy Risks: A Synthetic Benchmark and Evaluation Framework

Junhao Li; Jiahao Chen; Zhou Feng; Chunyi Zhou

arXiv:2511.03248·cs.CR·November 11, 2025

Auditing M-LLMs for Privacy Risks: A Synthetic Benchmark and Evaluation Framework

Junhao Li, Jiahao Chen, Zhou Feng, Chunyi Zhou

PDF

Open Access 1 Datasets

TL;DR

This paper introduces PRISM, a synthetic multi-modal benchmark and evaluation framework to assess privacy risks in M-LLMs, revealing their significant potential to infer sensitive personal attributes from social media-like data.

Contribution

The paper presents PRISM, the first comprehensive multi-modal privacy benchmark, and an evaluation framework to measure M-LLMs' capabilities in inferring private information, highlighting privacy vulnerabilities.

Findings

01

M-LLMs outperform humans in privacy inference accuracy

02

Six leading M-LLMs show significant privacy inference capabilities

03

PRISM enables targeted privacy risk analysis across diverse profiles

Abstract

Recent advances in multi-modal Large Language Models (M-LLMs) have demonstrated a powerful ability to synthesize implicit information from disparate sources, including images and text. These resourceful data from social media also introduce a significant and underexplored privacy risk: the inference of sensitive personal attributes from seemingly daily media content. However, the lack of benchmarks and comprehensive evaluations of state-of-the-art M-LLM capabilities hinders the research of private attribute profiling on social media. Accordingly, we propose (1) PRISM, the first multi-modal, multi-dimensional and fine-grained synthesized dataset incorporating a comprehensive privacy landscape and dynamic user history; (2) an Efficient evaluation framework that measures the cross-modal privacy inference capabilities of advanced M-LLM. Specifically, PRISM is a large-scale synthetic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

xaddh/multimodal-privacy
dataset· 25 dl
25 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Privacy, Security, and Data Protection