
TL;DR
This paper introduces multi-objective weighted sampling, a technique that efficiently summarizes large data sets to estimate multiple statistics simultaneously with guaranteed accuracy, applicable to key-value data and metric spaces.
Contribution
The paper develops new sampling schemes and algorithms for multi-objective samples, enabling accurate estimation of various functions across different data domains with minimal sample sizes.
Findings
Multi-objective samples provide simultaneous guarantees for multiple statistics.
Sampling schemes are efficient and produce small, scalable summaries.
Applications include key-value statistics and metric space costs.
Abstract
{\em Multi-objective samples} are powerful and versatile summaries of large data sets. For a set of keys and associated values , a weighted sample taken with respect to allows us to approximate {\em segment-sum statistics} , for any subset of the keys, with statistically-guaranteed quality that depends on sample size and the relative weight of . When estimating for , however, quality guarantees are lost. A multi-objective sample with respect to a set of functions provides for each the same statistical guarantees as a dedicated weighted sample while minimizing the summary size. We analyze properties of multi-objective samples and present sampling schemes and meta-algortithms for estimation and optimization while showcasing two important application domains. The first are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
