Instance-Optimality in I/O-Efficient Sampling and Sequential Estimation
Shyam Narayanan, V\'aclav Rozho\v{n}, Jakub T\v{e}tek, Mikkel Thorup

TL;DR
This paper develops instance-optimal algorithms for I/O-efficient sampling and estimation tasks, providing guarantees that adapt to the input's structure without assumptions, and extends these results to various statistical estimation problems.
Contribution
It introduces instance-optimal, order-oblivious algorithms for I/O-efficient sampling and estimation, with broad applications in statistical and distributional analysis.
Findings
Algorithms are instance-optimal among order-oblivious methods.
Results apply to estimating mean, histograms, quantiles, and distribution functions.
Provides non-parametric, instance-optimal bounds for several fundamental problems.
Abstract
Suppose we have a memory storing s and s and we want to estimate the frequency of s by sampling. We want to do this I/O-efficiently, exploiting that each read gives a block of bits at unit cost; not just one bit. If the input consists of uniform blocks: either all 1s or all 0s, then sampling a whole block at a time does not reduce the number of samples needed for estimation. On the other hand, if bits are randomly permuted, then getting a block of bits is as good as getting independent bit samples. However, we do not want to make any such assumptions on the input. Instead, our goal is to have an algorithm with instance-dependent performance guarantees which stops sampling blocks as soon as we know that we have a probabilistically reliable estimate. We prove our algorithms to be instance-optimal among algorithms oblivious to the order of the blocks, which we argue is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Process Monitoring · Industrial Vision Systems and Defect Detection
