Compliance Checking with NLI: Privacy Policies vs. Regulations

Amin Rabinia; Zane Nygaard

arXiv:2204.01845·cs.CL·April 6, 2022

Compliance Checking with NLI: Privacy Policies vs. Regulations

Amin Rabinia, Zane Nygaard

PDF

Open Access

TL;DR

This paper explores using Natural Language Inference models to automatically check privacy policies against regulations, comparing two training datasets and finding better real-world performance with the MNLI-trained model.

Contribution

It introduces an NLI-based approach for automated privacy policy compliance checking and evaluates the effectiveness of models trained on different datasets.

Findings

01

MNLI-trained model generalizes better to real-world policies.

02

SNLI-trained model has higher accuracy on test data.

03

NLI techniques can assist in legal compliance verification.

Abstract

A privacy policy is a document that states how a company intends to handle and manage their customers' personal data. One of the problems that arises with these privacy policies is that their content might violate data privacy regulations. Because of the enormous number of privacy policies that exist, the only realistic way to check for legal inconsistencies in all of them is through an automated method. In this work, we use Natural Language Inference (NLI) techniques to compare privacy regulations against sections of privacy policies from a selection of large companies. Our NLI model uses pre-trained embeddings, along with BiLSTM in its attention mechanism. We tried two versions of our model: one that was trained on the Stanford Natural Language Inference (SNLI) and the second on the Multi-Genre Natural Language Inference (MNLI) dataset. We found that our test accuracy was higher on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Computational and Text Analysis Methods · Artificial Intelligence in Law

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Bidirectional LSTM