Structure Selection for Fairness-Constrained Differentially Private Data Synthesis
Naeim Ghahramanpour, Mostafa Milani

TL;DR
This paper introduces PrivCI, a method for privacy-preserving data synthesis that enforces fairness constraints by integrating conditional independence into the measurement process, leading to more accurate and fair synthetic data.
Contribution
PrivCI is the first approach to incorporate conditional independence constraints directly into the measurement phase of differentially private data synthesis using a CI-aware greedy MST algorithm.
Findings
PrivCI outperforms existing methods in fidelity and accuracy.
It effectively enforces fairness constraints in synthetic data.
Experiments demonstrate improved predictive performance.
Abstract
Differential privacy (DP) enables safe data release, with synthetic data generation emerging as a common approach in recent years. Yet standard synthesizers preserve all dependencies in the data, including spurious correlations between sensitive attributes and outcomes. In fairness-critical settings, this reproduces unwanted bias. A principled remedy is to enforce conditional independence (CI) constraints, which encode domain knowledge or legal requirements that outcomes be independent of sensitive attributes once admissible factors are accounted for. DP synthesis typically proceeds in two phases: (i) a measure- ment step that privatizes selected marginals, often structured via maximum spanning trees (MSTs), and (ii) a reconstruction step that fits a probabilistic model consistent with the noisy marginals. We propose PrivCI, which enforces CI during the measurement step via a CI-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI · Data Quality and Management
