On Identifying a Massive Number of Distributions
Sara Shahi, Daniela Tuninetti, Natasha Devroye

TL;DR
This paper investigates the problem of identifying numerous underlying distributions from observed sequences, providing asymptotic bounds on the probability of error as both the number of distributions and sequences grow with the observation length.
Contribution
It introduces asymptotic bounds on distribution identification error when the number of distributions and sequences increases with blocklength.
Findings
Derived asymptotic upper bounds on error probability.
Established matching lower bounds on identification accuracy.
Abstract
Finding the underlying probability distributions of a set of observed sequences under the constraint that each sequence is generated i.i.d by a distinct distribution is considered. The number of distributions, and hence the number of observed sequences, are let to grow with the observation blocklength . Asymptotically matching upper and lower bounds on the probability of error are derived.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
