Dynamic Sampling from a Discrete Probability Distribution with a Known Distribution of Rates
Federico D'Ambrosio, Hans L. Bodlaender, Gerard T. Barkema

TL;DR
This paper develops efficient data structures for dynamic sampling from large discrete distributions with known rate bounds, achieving near-constant expected sampling and update times under certain conditions.
Contribution
It introduces novel combinations of basic data structures for dynamic sampling, providing theoretical bounds and practical implementation insights.
Findings
Expected sampling and update time can be constant under specific rate distribution conditions.
Combining tree structures with Acceptance-Rejection yields $O(rac{ ext{log log r_{max}}}{r_{min}})$ expected time.
Experimental results validate theoretical bounds and discuss real-world constraints.
Abstract
In this paper, we consider several efficient data structures for the problem of sampling from a dynamically changing discrete probability distribution, where some prior information is known on the distribution of the rates, in particular the maximum and minimum rate, and where the number of possible outcomes N is large. We consider three basic data structures, the Acceptance-Rejection method, the Complete Binary Tree and the Alias method. These can be used as building blocks in a multi-level data structure, where at each of the levels, one of the basic data structures can be used, with the top level selecting a group of events, and the bottom level selecting an element from a group. Depending on assumptions on the distribution of the rates of outcomes, different combinations of the basic structures can be used. We prove that for particular data structures the expected time of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Scientific Computing and Data Management · Statistical Methods in Clinical Trials
