Structure Selection for Fairness-Constrained Differentially Private Data Synthesis

Naeim Ghahramanpour; Mostafa Milani

arXiv:2603.12112·cs.DB·March 26, 2026

Structure Selection for Fairness-Constrained Differentially Private Data Synthesis

Naeim Ghahramanpour, Mostafa Milani

PDF

Open Access

TL;DR

This paper introduces PrivCI, a method for privacy-preserving data synthesis that enforces fairness constraints by integrating conditional independence into the measurement process, leading to more accurate and fair synthetic data.

Contribution

PrivCI is the first approach to incorporate conditional independence constraints directly into the measurement phase of differentially private data synthesis using a CI-aware greedy MST algorithm.

Findings

01

PrivCI outperforms existing methods in fidelity and accuracy.

02

It effectively enforces fairness constraints in synthetic data.

03

Experiments demonstrate improved predictive performance.

Abstract

Differential privacy (DP) enables safe data release, with synthetic data generation emerging as a common approach in recent years. Yet standard synthesizers preserve all dependencies in the data, including spurious correlations between sensitive attributes and outcomes. In fairness-critical settings, this reproduces unwanted bias. A principled remedy is to enforce conditional independence (CI) constraints, which encode domain knowledge or legal requirements that outcomes be independent of sensitive attributes once admissible factors are accounted for. DP synthesis typically proceeds in two phases: (i) a measure- ment step that privatizes selected marginals, often structured via maximum spanning trees (MSTs), and (ii) a reconstruction step that fits a probabilistic model consistent with the noisy marginals. We propose PrivCI, which enforces CI during the measurement step via a CI-aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Ethics and Social Impacts of AI · Data Quality and Management