Growing Alphabets in Canonical Shuffle Experiments: Likelihood-Ratio Laws, Estimation Bounds, and Low-Budget Equivariant Design
Alex Shvets

TL;DR
This paper analyzes privacy-preserving shuffle experiments for finite-output channels, establishing laws, bounds, and optimal mechanisms for frequency estimation under local differential privacy constraints.
Contribution
It introduces a likelihood-ratio law governing shuffle experiments, proves a sharp endpoint principle, and identifies optimal mechanisms in low-budget regimes.
Findings
Alphabet growth enhances privacy iff the worst pairwise law collapses to delta_1.
A sharp pure-LDP endpoint principle for pairwise chi-squared is established.
Augmented GRR is optimal among permutation-equivariant channels in low-budget regimes.
Abstract
We study canonical one-step neighboring shuffle experiments for finite-output epsilon_0-LDP d-ary channels along growing alphabets, with frequency estimation and mechanism design under a pairwise chi-squared budget. The pairwise likelihood-ratio law nu_{ab,d} (pushforward of the row ratio under the null row) is the governing invariant: the canonical shuffled histogram experiment is exactly equivalent to the quotient multinomial experiment generated by nu_{ab,d}. Alphabet growth improves canonical shuffled privacy iff the worst pairwise law collapses to delta_1. We prove a sharp pure-LDP endpoint principle for the pairwise chi-squared, construct full-support obstruction families saturating it, and establish a diluting/persistent dichotomy with explicit finite-n hockey-stick bounds. The worst-case pairwise budget chi_*(W) governs a two-regime Assouad lower bound for arbitrary estimators…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
