Support Size Estimation: The Power of Conditioning
Diptarka Chakraborty, Gunjan Kumar, Kuldeep S. Meel

TL;DR
This paper introduces new information-theoretic lower bounds for support size estimation using conditional sampling, significantly narrowing the gap between known upper and lower bounds and extending to models with additional oracle access.
Contribution
It provides the first nearly matching lower bounds for the COND model and its extensions, advancing understanding of the query complexity in support size estimation.
Findings
Nearly matching lower bounds for COND model.
First lower bounds for models with oracle access.
Queries needed grow with log log n and inverse epsilon factors.
Abstract
We consider the problem of estimating the support size of a distribution . Our investigations are pursued through the lens of distribution testing and seek to understand the power of conditional sampling (denoted as COND), wherein one is allowed to query the given distribution conditioned on an arbitrary subset . The primary contribution of this work is to introduce a new approach to lower bounds for the COND model that relies on using powerful tools from information theory and communication complexity. Our approach allows us to obtain surprisingly strong lower bounds for the COND model and its extensions. 1) We bridge the longstanding gap between the upper () and the lower bound for COND model by providing a nearly matching lower bound. Surprisingly, we show that even if we get to know the actual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
