Convergence of online $k$-means
Sanjoy Dasgupta, Gaurav Mahajan, Geelon So

TL;DR
This paper proves that online $k$-means algorithms converge asymptotically to stationary points when performed on streaming data, by interpreting them as stochastic gradient descent with adaptive learning rates.
Contribution
It establishes the convergence of a broad class of online $k$-means algorithms by linking them to stochastic gradient descent and extending existing optimization techniques.
Findings
Centers converge to stationary points asymptotically
Online $k$-means can be viewed as stochastic gradient descent
Convergence holds under adaptive, center-dependent learning rates
Abstract
We prove asymptotic convergence for a general class of -means algorithms performed over streaming data from a distribution: the centers asymptotically converge to the set of stationary points of the -means cost function. To do so, we show that online -means over a distribution can be interpreted as stochastic gradient descent with a stochastic learning rate schedule. Then, we prove convergence by extending techniques used in optimization literature to handle settings where center-specific learning rates may depend on the past trajectory of the centers.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
