
TL;DR
This paper introduces a flexible framework for adaptive threshold sampling that dynamically adjusts sampling probabilities, simplifying implementation and enabling new sampling strategies for various complex data analysis tasks.
Contribution
It presents a novel framework for adaptive threshold sampling that handles dependence among samples and applies to diverse problems like top-K and stratified sampling.
Findings
Framework simplifies adaptive sampling implementation.
New algorithms for top-K and stratified sampling.
Addresses dependence issues in adaptive sampling.
Abstract
Sampling is a fundamental problem in computer science and statistics. However, for a given task and stream, it is often not possible to choose good sampling probabilities in advance. We derive a general framework for adaptively changing the sampling probabilities via a collection of thresholds.In general, adaptive sampling procedures introduce dependence amongst the sampled points, making it difficult to compute expectations and ensure estimators are unbiased or consistent. Our framework address this issue and further shows when adaptive thresholds can be treated as if they were fixed thresholds which samples items independently. This makes our adaptive sampling schemes simple to apply as there is no need to create custom estimators for the sampling method. Using our framework, we derive new samplers that can address a broad range of new and existing problems including sampling with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Adversarial Robustness in Machine Learning · Mobile Crowdsensing and Crowdsourcing
