Natural Language Processing of Privacy Policies: A Survey

Andrick Adhikari; Sanchari Das; Rinku Dewri

arXiv:2501.10319·cs.CL·January 20, 2025

Natural Language Processing of Privacy Policies: A Survey

Andrick Adhikari, Sanchari Das, Rinku Dewri

PDF

Open Access

TL;DR

This survey reviews NLP techniques applied to privacy policies, highlighting current methods, gaps, and future research opportunities to improve privacy notice communication and understanding.

Contribution

It provides a comprehensive analysis of 109 papers on NLP for privacy policies, identifying research gaps and suggesting future directions for robust privacy policy NLP applications.

Findings

01

Many studies focus on annotating and classifying privacy texts

02

Significant research gaps exist in summarization and contextualized embeddings

03

Opportunities for domain-specific model tuning are identified

Abstract

Natural Language Processing (NLP) is an essential subset of artificial intelligence. It has become effective in several domains, such as healthcare, finance, and media, to identify perceptions, opinions, and misuse, among others. Privacy is no exception, and initiatives have been taken to address the challenges of usable privacy notifications to users with the help of NLP. To this aid, we conduct a literature review by analyzing 109 papers at the intersection of NLP and privacy policies. First, we provide a brief introduction to privacy policies and discuss various facets of associated problems, which necessitate the application of NLP to elevate the current state of privacy notices and disclosures to users. Subsequently, we a) provide an overview of the implementation and effectiveness of NLP approaches for better privacy policy communication; b) identify the methodologies that can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy, Security, and Data Protection · Cybercrime and Law Enforcement Studies

MethodsFocus