pyCANON: A Python library to check the level of anonymity of a dataset
Judith S\'ainz-Pardo D\'iaz, \'Alvaro L\'opez Garc\'ia

TL;DR
pyCANON is a Python library that assesses the level of dataset anonymity using multiple privacy-preserving techniques, providing detailed reports to help ensure data privacy before sharing.
Contribution
This work introduces pyCANON, a comprehensive Python tool that evaluates dataset anonymity across various privacy models, facilitating privacy assessment with a user-friendly interface.
Findings
Supports multiple anonymity techniques including k-anonymity, l-diversity, t-closeness.
Provides detailed reports on privacy parameters for datasets.
Enables evaluation of multiple sensitive attributes with two distinct approaches.
Abstract
Openly sharing data with sensitive attributes and privacy restrictions is a challenging task. In this document we present the implementation of pyCANON, a Python library and command line interface (CLI) to check and assess the level of anonymity of a dataset through some of the most common anonymization techniques: k-anonymity, (,k)-anonymity, -diversity, entropy -diversity, recursive (c,)-diversity, basic -likeness, enhanced -likeness, t-closeness and -disclosure privacy. For the case of more than one sensitive attributes, two approaches are proposed for evaluating this techniques. The main strength of this library is to obtain a full report of the parameters that are fulfilled for each of the techniques mentioned above, with the unique requirement of the set of quasi-identifiers and that of sensitive attributes. We present the methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Internet Traffic Analysis and Secure E-voting
