Improved Coresets for Clustering with Capacity and Fairness Constraints
Lingxiao Huang, Pinyan Lu, Xuan Wu

TL;DR
This paper presents an improved algorithm for constructing small coresets for capacitated and fair clustering problems, reducing computational complexity and coreset size compared to previous methods.
Contribution
It introduces a near-linear time algorithm with smaller coreset sizes for capacitated and fair clustering, improving hierarchical sampling by adaptive selection based on clustering cost.
Findings
Reduces coreset size for capacitated clustering by a factor of $k \\varepsilon^{-z}$.
Achieves near-linear time construction of $ ilde{O}(k^2\\\varepsilon^{-2z-2})$-sized coresets.
Simplifies analysis by decreasing the effective centers from $ ilde{O}(k^2\\\varepsilon^{-z})$ to $O(k \\\log \\\varepsilon^{-1})$.
Abstract
We study coresets for clustering with capacity and fairness constraints. Our main result is a near-linear time algorithm to construct -sized -coresets for capacitated -clustering which improves a recent bound by [BCAJ+22, HJLW23]. As a corollary, we also save a factor of on the coreset size for fair -clustering compared to them. We fundamentally improve the hierarchical uniform sampling framework of [BCAJ+22] by adaptively selecting sample size on each ring instance, proportional to its clustering cost to an optimal solution. Our analysis relies on a key geometric observation that reduces the number of total ``effective centers" from [BCAJ+22]'s to merely by being able to ``ignore'' all center points that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Face and Expression Recognition
