Convergence of online $k$-means

Sanjoy Dasgupta; Gaurav Mahajan; Geelon So

arXiv:2202.10640·cs.LG·February 23, 2022

Convergence of online $k$-means

Sanjoy Dasgupta, Gaurav Mahajan, Geelon So

PDF

Open Access

TL;DR

This paper proves that online $k$-means algorithms converge asymptotically to stationary points when performed on streaming data, by interpreting them as stochastic gradient descent with adaptive learning rates.

Contribution

It establishes the convergence of a broad class of online $k$-means algorithms by linking them to stochastic gradient descent and extending existing optimization techniques.

Findings

01

Centers converge to stationary points asymptotically

02

Online $k$-means can be viewed as stochastic gradient descent

03

Convergence holds under adaptive, center-dependent learning rates

Abstract

We prove asymptotic convergence for a general class of $k$ -means algorithms performed over streaming data from a distribution: the centers asymptotically converge to the set of stationary points of the $k$ -means cost function. To do so, we show that online $k$ -means over a distribution can be interpreted as stochastic gradient descent with a stochastic learning rate schedule. Then, we prove convergence by extending techniques used in optimization literature to handle settings where center-specific learning rates may depend on the past trajectory of the centers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research