On Identifying a Massive Number of Distributions

Sara Shahi; Daniela Tuninetti; Natasha Devroye

arXiv:1801.04593·cs.IT·October 16, 2018

On Identifying a Massive Number of Distributions

Sara Shahi, Daniela Tuninetti, Natasha Devroye

PDF

TL;DR

This paper investigates the problem of identifying numerous underlying distributions from observed sequences, providing asymptotic bounds on the probability of error as both the number of distributions and sequences grow with the observation length.

Contribution

It introduces asymptotic bounds on distribution identification error when the number of distributions and sequences increases with blocklength.

Findings

01

Derived asymptotic upper bounds on error probability.

02

Established matching lower bounds on identification accuracy.

Abstract

Finding the underlying probability distributions of a set of observed sequences under the constraint that each sequence is generated i.i.d by a distinct distribution is considered. The number of distributions, and hence the number of observed sequences, are let to grow with the observation blocklength $n$ . Asymptotically matching upper and lower bounds on the probability of error are derived.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.