Minimax optimal differentially private synthetic data for smooth queries
Rundong Ding, Yiyun He, Yizhe Zhu

TL;DR
This paper introduces a minimax optimal algorithm for generating differentially private synthetic data that provides strong utility guarantees for smooth queries, improving over prior bounds by exploiting smoothness structure.
Contribution
It proposes a polynomial-time method achieving optimal error rates for smooth queries, generalizing previous frameworks and establishing the first minimax lower bounds for this setting.
Findings
Achieves minimax error rate of n^{-min{1,k/d}} for smooth queries
Uncovers a phase transition at k=d in utility guarantees
Improves upon previous error bounds for smooth query privacy
Abstract
Differentially private synthetic data enables the sharing and analysis of sensitive datasets while providing rigorous privacy guarantees for individual contributors. A central challenge is to achieve strong utility guarantees for meaningful downstream analysis. Many existing methods ensure uniform accuracy over broad query classes, such as all Lipschitz functions, but this level of generality often leads to suboptimal rates for statistics of practical interest. Since many common data analysis queries exhibit smoothness beyond what worst-case Lipschitz bounds capture, we ask whether exploiting this additional structure can yield improved utility. We study the problem of generating -differentially private synthetic data from a dataset of size supported on the hypercube , with utility guarantees uniformly for all smooth queries having bounded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Markov Chains and Monte Carlo Methods
