Improved analysis of D2-sampling based PTAS for k-means and other   Clustering problems

Ragesh Jaiswal; Mehul Kumar; Pulkit Yadav

arXiv:1401.3685·cs.DS·January 16, 2014·1 cites

Improved analysis of D2-sampling based PTAS for k-means and other Clustering problems

Ragesh Jaiswal, Mehul Kumar, Pulkit Yadav

PDF

Open Access

TL;DR

This paper improves the theoretical analysis of a D^2-sampling based PTAS for k-means clustering, reducing its running time from exponential in k^2 to exponential in k, making it more efficient.

Contribution

The authors provide a tighter analysis that significantly reduces the running time of the existing PTAS for k-means clustering.

Findings

01

Running time improved from $O(nd imes 2^{ ilde{O}(k^2/\epsilon)})$ to $O(nd imes 2^{ ilde{O}(k/\epsilon)})$.

02

Analysis enhances understanding of D^2-sampling efficiency.

03

Potential for faster clustering algorithms in high-dimensional data.

Abstract

We give an improved analysis of the simple $D^{2}$ -sampling based PTAS for the $k$ -means clustering problem given by Jaiswal, Kumar, and Sen (Algorithmica, 2013). The improvement on the running time is from $O (n d \cdot 2^{\tilde{O} (k^{2} / ϵ)})$ to $O (n d \cdot 2^{\tilde{O} (k / ϵ)})$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Distributed Sensor Networks and Detection Algorithms · Advanced Clustering Algorithms Research