Local Search k-means++ with Foresight
Theo Conrads, Lukas Drexler, Joshua K\"onen, Daniel R. Schmidt,, Melanie Schmidt

TL;DR
This paper introduces Foresight LS++, a new local search enhancement for $k$-means++ clustering that improves solution quality while maintaining theoretical guarantees and runtime efficiency.
Contribution
The paper proposes Foresight LS++, a novel algorithm that combines local search with Lloyd's algorithm to enhance $k$-means++ clustering performance.
Findings
Foresight LS++ outperforms LS++ in solution quality in experiments.
Foresight LS++ retains the same runtime and approximation bounds as LS++.
Combining LS++ with greedy $k$-means++ improves practical results despite worse theoretical guarantees.
Abstract
Since its introduction in 1957, Lloyd's algorithm for -means clustering has been extensively studied and has undergone several improvements. While in its original form it does not guarantee any approximation factor at all, Arthur and Vassilvitskii (SODA 2007) proposed -means++ which enhances Lloyd's algorithm by a seeding method which guarantees a -approximation in expectation. More recently, Lattanzi and Sohler (ICML 2019) proposed LS++ which further improves the solution quality of -means++ by local search techniques to obtain a -approximation. On the practical side, the greedy variant of -means++ is often used although its worst-case behaviour is provably worse than for the standard -means++ variant. We investigate how to improve LS++ further in practice. We study two options for improving the practical performance: (a) Combining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Data Mining Algorithms and Applications · Machine Learning and Data Classification
