Characterizing the Distinguishability of Product Distributions through Multicalibration
Cassandra Marcussen, Aaron Putterman, Salil Vadhan

TL;DR
This paper establishes a new framework linking the computational indistinguishability of product distributions to their information-theoretic properties, providing tight bounds on the number of samples needed for efficient distinction.
Contribution
It introduces a reduction method that relates computational indistinguishability to information-theoretic measures, leading to a tight characterization of sample complexity for distinguishing product distributions.
Findings
Derived a tight bound for sample complexity based on Hellinger distance
Re-derives and extends previous results on indistinguishability scaling
Provides a general framework applicable to arbitrary product distributions
Abstract
Given a sequence of samples promised to be drawn from one of two distributions , a well-studied problem in statistics is to decide distribution the samples are from. Information theoretically, the maximum advantage in distinguishing the two distributions given samples is captured by the total variation distance between and . However, when we restrict our attention to (i.e., small circuits) of these two distributions, exactly characterizing the ability to distinguish and is more involved and less understood. In this work, we give a general way to reduce bounds on the computational indistinguishability of and to bounds on the indistinguishability of some specific, related variables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
