Private Counting of Distinct Elements in the Turnstile Model and   Extensions

Monika Henzinger; A. R. Sricharan; Teresa Anna Steiner

arXiv:2408.11637·cs.DS·August 22, 2024

Private Counting of Distinct Elements in the Turnstile Model and Extensions

Monika Henzinger, A. R. Sricharan, Teresa Anna Steiner

PDF

TL;DR

This paper advances private algorithms for counting distinct elements in data streams, introducing a simple method with tight error bounds and analyzing privacy guarantees across different models, addressing open questions in the field.

Contribution

It presents a simple, tight-error private algorithm based on the sparse vector technique and extends lower bounds from item-level to event-level differential privacy.

Findings

01

A simple sparse vector-based algorithm achieves tight error bounds.

02

Lower bounds for item-level privacy extend to event-level privacy.

03

Addresses open questions from prior work on privacy models.

Abstract

Privately counting distinct elements in a stream is a fundamental data analysis problem with many applications in machine learning. In the turnstile model, Jain et al. [NeurIPS2023] initiated the study of this problem parameterized by the maximum flippancy of any element, i.e., the number of times that the count of an element changes from 0 to above 0 or vice versa. They give an item-level $(ϵ, δ)$ -differentially private algorithm whose additive error is tight with respect to that parameterization. In this work, we show that a very simple algorithm based on the sparse vector technique achieves a tight additive error for item-level $(ϵ, δ)$ -differential privacy and item-level $ϵ$ -differential privacy with regards to a different parameterization, namely the sum of all flippancies. Our second result is a bound which shows that for a large class of algorithms,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.