Estimating the Effective Support Size in Constant Query Complexity
Shyam Narayanan, Jakub T\v{e}tek

TL;DR
This paper presents a simple, constant-query complexity algorithm for estimating the effective support size of a distribution, improving upon previous methods that required support-size-dependent queries.
Contribution
It introduces a novel algorithm that achieves constant-query complexity for estimating the effective support size, removing the need for bicriteria relaxation.
Findings
Algorithm with $O(1/eta^3 \epsilon^3)$ query complexity
Achieves support size estimation within a factor of $(1+eta)$ and $ ext{Ess}_\epsilon$
Simpler pseudocode with only 4 lines
Abstract
Estimating the support size of a distribution is a well-studied problem in statistics. Motivated by the fact that this problem is highly non-robust (as small perturbations in the distributions can drastically affect the support size) and thus hard to estimate, Goldreich [ECCC 2019] studied the query complexity of estimating the -\emph{effective support size} of a distribution , which is equal to the smallest support size of a distribution that is -far in total variation distance from . In his paper, he shows an algorithm in the dual access setting (where we may both receive random samples and query the sampling probability for any ) for a bicriteria approximation, giving an answer in for some values . However, his algorithm has either…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Markov Chains and Monte Carlo Methods · Bayesian Modeling and Causal Inference
