Verified Foundations for Differential Privacy
Markus de Medeiros, Muhammad Naveed, Tancr\`ede Lepoint, Temesghen, Kahsai, Tristan Ravitch, Stefan Zetzsche, Anjali Joshi, Joseph Tassarotti,, Aws Albarghouthi, Jean-Baptiste Tristan

TL;DR
This paper introduces SampCert, a comprehensive mechanized foundation for differential privacy in Lean, including verified sampling algorithms, enabling reliable DP implementations in production systems like AWS.
Contribution
It provides the first formal, extensible foundation for differential privacy with verified sampling algorithms, improving correctness and deployment in real-world systems.
Findings
Verified Laplace and Gaussian sampling algorithms
SampCert's foundation supports various DP definitions
Powering AWS's DP offerings with verified primitives
Abstract
Differential privacy (DP) has become the gold standard for privacy-preserving data analysis, but implementing it correctly has proven challenging. Prior work has focused on verifying DP at a high level, assuming the foundations are correct and a perfect source of randomness is available. However, the underlying theory of differential privacy can be very complex and subtle. Flaws in basic mechanisms and random number generation have been a critical source of vulnerabilities in real-world DP systems. In this paper, we present SampCert, the first comprehensive, mechanized foundation for differential privacy. SampCert is written in Lean with over 12,000 lines of proof. It offers a generic and extensible notion of DP, a framework for constructing and composing DP mechanisms, and formally verified implementations of Laplace and Gaussian sampling algorithms. SampCert provides (1) a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
