Privately Releasing Conjunctions and the Statistical Query Barrier
Anupam Gupta, Moritz Hardt, Aaron Roth, Jonathan Ullman

TL;DR
This paper explores the complexity of privately releasing answers to conjunctions and statistical queries, establishing fundamental bounds and providing efficient algorithms for certain classes, advancing privacy-preserving data analysis.
Contribution
It characterizes the query complexity in terms of agnostic learning and introduces efficient algorithms for submodular answer classes, including the first private release for Boolean conjunctions.
Findings
Query complexity matches agnostic learning complexity in the SQ model.
Efficient private algorithms exist for submodular answer classes like graph cuts.
First differentially private algorithm for Boolean conjunctions with 1% error.
Abstract
Suppose we would like to know all answers to a set of statistical queries C on a data set up to small error, but we can only access the data itself using statistical queries. A trivial solution is to exhaustively ask all queries in C. Can we do any better? + We show that the number of statistical queries necessary and sufficient for this task is---up to polynomial factors---equal to the agnostic learning complexity of C in Kearns' statistical query (SQ) model. This gives a complete answer to the question when running time is not a concern. + We then show that the problem can be solved efficiently (allowing arbitrary error on a small fraction of queries) whenever the answers to C can be described by a submodular function. This includes many natural concept classes, such as graph cuts and Boolean disjunctions and conjunctions. While interesting from a learning theoretic point of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
