A Tight VC-Dimension Analysis of Clustering Coresets with Applications
Vincent Cohen-Addad, Andrew Draganov, Matteo Russo, David, Saulpic, Chris Schwiegelshohn

TL;DR
This paper provides a sharp VC-dimension based analysis for constructing coresets in k-clustering problems, leading to improved bounds for specific metrics like shortest path in planar graphs and polygonal curves.
Contribution
It introduces a novel VC-dimension analysis approach that yields tighter coreset size bounds for various clustering metrics compared to previous results.
Findings
Improved coreset bounds for shortest path metrics in planar graphs.
Enhanced coreset sizes for clustering polygonal curves under Frechet metrics.
Theoretical advancement in coreset construction using VC-dimension analysis.
Abstract
We consider coresets for -clustering problems, where the goal is to assign points to centers minimizing powers of distances. A popular example is the -median objective . Given a point set , a coreset is a small weighted subset that approximates the cost of for all candidate solutions up to a multiplicative factor. In this paper, we give a sharp VC-dimension based analysis for coreset construction. As a consequence, we obtain improved -median coreset bounds for the following metrics: Coresets of size for shortest path metrics in planar graphs, improving over the bounds by [Cohen-Addad, Saulpic, Schwiegelshohn, STOC'21] and by [Braverman, Jiang, Krauthgamer, Wu, SODA'21].…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Coresets
