DP-S4S: Accurate and Scalable Select-Join-Aggregate Query Processing with User-Level Differential Privacy
Yuan Qiu, Xiaokui Xiao, Yin Yang

TL;DR
This paper introduces DP-S4S, a scalable and accurate method for user-level differentially private select-join-aggregate queries that leverages sampling of aggregation units and RDP composition to improve efficiency and utility.
Contribution
DP-S4S is a novel mechanism that samples aggregation units and uses RDP for better composition, addressing scalability and accuracy issues in private SJA query processing.
Findings
DP-S4S achieves high utility on large datasets with user-level DP.
DP-S4S reduces computational overhead compared to existing methods.
Experimental results show DP-S4S outperforms prior solutions in accuracy and scalability.
Abstract
Answering Select-Join-Aggregate queries with DP is a fundamental problem with important applications in various domains. The current SOTA methods ensure user-level DP (i.e., the adversary cannot infer the presence or absence of any given individual user with high confidence) and achieve instance-optimal accuracy on the query results. However, these solutions involve solving expensive optimization programs, which may incur prohibitive computational overhead for large databases. One promising direction to achieve scalability is through sampling, which provides a tunable trade-off between result utility and computational costs. However, applying sampling to differentially private SJA processing is a challenge for two reasons. First, it is unclear what to sample, in order to achieve the best accuracy within a given computational budget. Second, prior solutions were not designed with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Data Management and Algorithms
