Introduction to Coresets: Approximated Mean
Alaa Maalouf, Ibrahim Jubran, Dan Feldman

TL;DR
This survey introduces core concepts, techniques, and algorithms for constructing coresets for mean queries, providing unified analysis, proofs, and open-source code to aid researchers and practitioners in understanding and applying coresets.
Contribution
The paper offers a comprehensive, unified analysis of mean coreset construction methods, including classical and modern results, with detailed proofs and open-source implementations.
Findings
Collected and formalized folklore results
Unified analysis methodology for coreset constructions
Provided open-source code for algorithms
Abstract
A \emph{strong coreset} for the mean queries of a set in is a small weighted subset , which provably approximates its sum of squared distances to any center (point) . A \emph{weak coreset} is (also) a small weighted subset of , whose mean approximates the mean of . While computing the mean of can be easily computed in linear time, its coreset can be used to solve harder constrained version, and is in the heart of generalizations such as coresets for -means clustering. In this paper, we survey most of the mean coreset construction techniques, and suggest a unified analysis methodology for providing and explaining classical and modern results including step-by-step proofs. In particular, we collected folklore and scattered related results, some of which are not formally stated elsewhere. Throughout this survey, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
MethodsCoresets
