DPSQL+: A Differentially Private SQL Library with a Minimum Frequency Rule
Tomoya Matsumoto, Shokichi Takakura, Shun Takagi, Satoshi Hasegawa

TL;DR
DPSQL+ is a modular, privacy-preserving SQL library that enforces differential privacy and a minimum frequency rule, enabling practical, accurate data analysis while maintaining strong privacy guarantees.
Contribution
It introduces a novel architecture combining query validation, privacy accounting, and database interfacing to enforce DP and minimum frequency rules in SQL queries.
Findings
Achieves practical accuracy on TPC-H benchmarks for various SQL operations.
Allows more queries under a fixed privacy budget compared to prior solutions.
Successfully enforces minimum frequency rule alongside differential privacy.
Abstract
SQL is the de facto interface for exploratory data analysis; however, releasing exact query results can expose sensitive information through membership or attribute inference attacks. Differential privacy (DP) provides rigorous privacy guarantees, but in practice, DP alone may not satisfy governance requirements such as the \emph{minimum frequency rule}, which requires each released group (cell) to include contributions from at least distinct individuals. In this paper, we present \textbf{DPSQL+}, a privacy-preserving SQL library that simultaneously enforces user-level -DP and the minimum frequency rule. DPSQL+ adopts a modular architecture consisting of: (i) a \emph{Validator} that statically restricts queries to a DP-safe subset of SQL; (ii) an \emph{Accountant} that consistently tracks cumulative privacy loss across multiple queries; and (iii) a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
