Adaptive Threshold Sampling

Daniel Ting

arXiv:1708.04970·stat.ML·June 16, 2022

Adaptive Threshold Sampling

Daniel Ting

PDF

Open Access

TL;DR

This paper introduces a flexible framework for adaptive threshold sampling that dynamically adjusts sampling probabilities, simplifying implementation and enabling new sampling strategies for various complex data analysis tasks.

Contribution

It presents a novel framework for adaptive threshold sampling that handles dependence among samples and applies to diverse problems like top-K and stratified sampling.

Findings

01

Framework simplifies adaptive sampling implementation.

02

New algorithms for top-K and stratified sampling.

03

Addresses dependence issues in adaptive sampling.

Abstract

Sampling is a fundamental problem in computer science and statistics. However, for a given task and stream, it is often not possible to choose good sampling probabilities in advance. We derive a general framework for adaptively changing the sampling probabilities via a collection of thresholds.In general, adaptive sampling procedures introduce dependence amongst the sampled points, making it difficult to compute expectations and ensure estimators are unbiased or consistent. Our framework address this issue and further shows when adaptive thresholds can be treated as if they were fixed thresholds which samples items independently. This makes our adaptive sampling schemes simple to apply as there is no need to create custom estimators for the sampling method. Using our framework, we derive new samplers that can address a broad range of new and existing problems including sampling with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Adversarial Robustness in Machine Learning · Mobile Crowdsensing and Crowdsourcing