Auditing M-LLMs for Privacy Risks: A Synthetic Benchmark and Evaluation Framework
Junhao Li, Jiahao Chen, Zhou Feng, Chunyi Zhou

TL;DR
This paper introduces PRISM, a synthetic multi-modal benchmark and evaluation framework to assess privacy risks in M-LLMs, revealing their significant potential to infer sensitive personal attributes from social media-like data.
Contribution
The paper presents PRISM, the first comprehensive multi-modal privacy benchmark, and an evaluation framework to measure M-LLMs' capabilities in inferring private information, highlighting privacy vulnerabilities.
Findings
M-LLMs outperform humans in privacy inference accuracy
Six leading M-LLMs show significant privacy inference capabilities
PRISM enables targeted privacy risk analysis across diverse profiles
Abstract
Recent advances in multi-modal Large Language Models (M-LLMs) have demonstrated a powerful ability to synthesize implicit information from disparate sources, including images and text. These resourceful data from social media also introduce a significant and underexplored privacy risk: the inference of sensitive personal attributes from seemingly daily media content. However, the lack of benchmarks and comprehensive evaluations of state-of-the-art M-LLM capabilities hinders the research of private attribute profiling on social media. Accordingly, we propose (1) PRISM, the first multi-modal, multi-dimensional and fine-grained synthesized dataset incorporating a comprehensive privacy landscape and dynamic user history; (2) an Efficient evaluation framework that measures the cross-modal privacy inference capabilities of advanced M-LLM. Specifically, PRISM is a large-scale synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Privacy, Security, and Data Protection
