# Structuring privacy policy: an AI approach

**Authors:** Shani Alkoby, Ron S. Hirschprung

PMC · DOI: 10.3389/frai.2025.1720547 · Frontiers in Artificial Intelligence · 2026-01-14

## TL;DR

This paper introduces an AI method to automatically structure privacy policy documents, making them easier to understand and use.

## Contribution

A novel two-layer AI methodology for structuring unstructured privacy policy texts into predefined parameters.

## Key findings

- The method achieved an average F1-score > 0.8 across 49 privacy policies.
- Five of six parameters showed very high classification accuracy in the empirical study.

## Abstract

Privacy has become a significant concern in the digital world, especially concerning the personal data collected by websites and other service providers on the World Wide Web network. One of the significant approaches to enable the individual to control privacy is the privacy policy document, which contains vital information on this matter. Publishing a privacy policy is required by regulation in most Western countries. However, the privacy policy document is a natural free text-based object, usually phrased in a legal language, and rapidly changes, making it consequently relatively hard to understand and almost always neglected by humans.

This research proposes a novel methodology to receive an unstructured privacy policy text and automatically structure it into predefined parameters. The methodology is based on a two-layer artificial intelligence (AI) process.

In an empirical study that included 49 actual privacy policies from different websites, we demonstrated an average F1-score > 0.8 where five of six parameters achieved a very high classification accuracy.

This methodology can serve both humans and AI agents by addressing issues such as cognitive burden, non-standard formalizations, cognitive laziness, and the dynamics of the document across a timeline, which deters the use of the privacy policy as a resource. The study addresses a critical gap between the present regulations, aiming at enhancing privacy, and the abilities of humans to benefit from the mandatory published privacy policy.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12847394/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12847394/full.md

## References

92 references — full list in the complete paper: https://tomesphere.com/paper/PMC12847394/full.md

---
Source: https://tomesphere.com/paper/PMC12847394