NLP Privacy Risk Identification in Social Media (NLP-PRISM): A Survey

Dhiman Goswami; Jai Kruthunz Naveen Kumar; Sanchari Das

arXiv:2602.15866·cs.CL·February 19, 2026

NLP Privacy Risk Identification in Social Media (NLP-PRISM): A Survey

Dhiman Goswami, Jai Kruthunz Naveen Kumar, Sanchari Das

PDF

Open Access 1 Video

TL;DR

This survey reviews privacy risks in social media NLP applications, introduces the NLP-PRISM framework for risk assessment, and highlights significant gaps and trade-offs in current privacy-preserving NLP methods.

Contribution

The paper presents NLP-PRISM, a comprehensive framework for identifying privacy vulnerabilities in social media NLP, and provides an extensive analysis of privacy risks across multiple NLP tasks and models.

Findings

01

Transformer models have F1-scores of 0.58-0.84 but drop 1%-23% with privacy tuning.

02

Significant gaps exist in privacy coverage across six NLP tasks.

03

Privacy attacks show a 2%-9% utility reduction and high attack success rates.

Abstract

Natural Language Processing (NLP) is integral to social media analytics but often processes content containing Personally Identifiable Information (PII), behavioral cues, and metadata raising privacy risks such as surveillance, profiling, and targeted advertising. To systematically assess these risks, we review 203 peer-reviewed papers and propose the NLP Privacy Risk Identification in Social Media (NLP-PRISM) framework, which evaluates vulnerabilities across six dimensions: data collection, preprocessing, visibility, fairness, computational risk, and regulatory compliance. Our analysis shows that transformer models achieve F1-scores ranging from 0.58-0.84, but incur a 1% - 23% drop under privacy-preserving fine-tuning. Using NLP-PRISM, we examine privacy coverage in six NLP tasks: sentiment analysis (16), emotion detection (14), offensive language identification (19), code-mixed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

NLP Privacy Risk Identification in Social Media (NLP-PRISM): A Survey· underline

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Authorship Attribution and Profiling