Approximating k-Center via Farthest-First on $\delta$-Covers
Jason R. Wilson

TL;DR
This paper explores using $ ext{delta}$-covers to scale the farthest-first algorithm for the $k$-center problem, showing it maintains near-optimal solutions with reduced computational effort on large datasets.
Contribution
It proves that applying farthest-first on a $ ext{delta}$-cover yields a solution within twice the optimal radius plus $ ext{delta}$, enabling scalable clustering.
Findings
Significant reduction in running time on large high-dimensional datasets.
Modest increase in the $k$-center radius when using $ ext{delta}$-covers.
Theoretical guarantee of solution quality when using $ ext{delta}$-covers.
Abstract
The farthest-first traversal of Gonzalez is a classical -approximation algorithm for solving the -center problem, but its sequential nature makes it difficult to scale to very large datasets. In this work we study the effect of running farthest-first on a -cover of the dataset rather than on the full set of points. A -cover provides a compact summary of the data in which every point lies within distance of some selected center. We prove that if farthest-first is applied to a -cover, the resulting -center radius is at most twice the optimal radius plus . In our experiments on large high-dimensional datasets, we show that restricting the input to a -cover dramatically reduces the running time of the farthest-first traversal while only modestly increasing the -center radius.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Computational Geometry and Mesh Generation · Facility Location and Emergency Management
