PRIVEE: A Visual Analytic Workflow for Proactive Privacy Risk Inspection of Open Data
Kaustav Bhattacharjee, Akm Islam, Jaideep Vaidya, and Aritra Dasgupta

TL;DR
PRIVEE is a visual analytic tool designed to help data defenders proactively identify and interpret privacy risks in open datasets, especially those vulnerable to malicious joins, by providing interactive visualizations and risk assessments.
Contribution
The paper introduces PRIVEE, a novel visual analytic workflow that enables proactive privacy risk inspection in open data, developed through a design study with privacy researchers.
Findings
PRIVEE effectively identifies vulnerable data joins and privacy risks.
The tool helps simulate attack scenarios for better risk understanding.
Case studies demonstrate its utility in real-world privacy assessments.
Abstract
Open data sets that contain personal information are susceptible to adversarial attacks even when anonymized. By performing low-cost joins on multiple datasets with shared attributes, malicious users of open data portals might get access to information that violates individuals' privacy. However, open data sets are primarily published using a release-and-forget model, whereby data owners and custodians have little to no cognizance of these privacy risks. We address this critical gap by developing a visual analytic solution that enables data defenders to gain awareness about the disclosure risks in local, joinable data neighborhoods. The solution is derived through a design study with data privacy researchers, where we initially play the role of a red team and engage in an ethical data hacking exercise based on privacy attack scenarios. We use this problem and domain characterization to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Complex Network Analysis Techniques · Mental Health Research Topics
