DIPS: Optimal Dynamic Index for Poisson $\boldsymbol{\pi}$ps Sampling
Jinchao Huang, Sibo Wang

TL;DR
This paper introduces a dynamic index for Poisson $oldsymbol{ ext{pi}}$ps sampling that supports constant-time queries and updates, enabling efficient handling of dynamic data with practical applications in data mining.
Contribution
We propose a novel dynamic index tailored for Poisson $ ext{pi}$ps sampling, achieving optimal expected query and update times with a recursive partitioning approach.
Findings
Supports $ ext{O}(1)$ expected query and update time
Achieves $ ext{O}(n)$ space complexity
Demonstrates significant speedups in empirical evaluations
Abstract
This paper addresses the Poisson ps sampling problem, a topic of significant academic interest in various domains and with practical data mining applications, such as influence maximization. The problem includes a set of elements, where each element is assigned a weight reflecting its importance. The goal is to generate a random subset of , where each element is included in independently with probability , where is a constant. The subsets must be independent across different queries. While the Poisson ps sampling problem can be reduced to the well-studied subset sampling problem, updates in Poisson ps sampling, such as adding a new element or removing an element, would cause the probabilities of all elements to change in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models
