TL;DR
Cookiescanner is an automated tool that detects GDPR consent notices on websites, evaluates their features, and helps assess compliance, addressing limitations of previous manual or limited automated methods.
Contribution
We introduce cookiescanner, an automated, scalable tool for detecting and analyzing GDPR consent notices, including a new annotated dataset for benchmarking.
Findings
Filter lists have high precision but lower recall than keyword methods.
BERT achieves high precision but low recall for English notices.
Color detection of buttons is effective for identifying decline options.
Abstract
The enforcement of the GDPR led to the widespread adoption of consent notices, colloquially known as cookie banners. Studies have shown that many website operators do not comply with the law and track users prior to any interaction with the consent notice, or attempt to trick users into giving consent through dark patterns. Previous research has relied on manually curated filter lists or automated detection methods limited to a subset of websites, making research on GDPR compliance of consent notices tedious or limited. We present \emph{cookiescanner}, an automated scanning tool that detects and extracts consent notices via various methods and checks if they offer a decline option or use color diversion. We evaluated cookiescanner on a random sample of the top 10,000 websites listed by Tranco. We found that manually curated filter lists have the highest precision but recall fewer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
