On Convergence of Epanechnikov Mean Shift

Kejun Huang; Xiao Fu; Nicholas D. Sidiropoulos

arXiv:1711.07441·stat.ML·November 21, 2017·2 cites

On Convergence of Epanechnikov Mean Shift

Kejun Huang, Xiao Fu, Nicholas D. Sidiropoulos

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of Epanechnikov Mean Shift, proposes fixes for convergence issues, and demonstrates its effectiveness for clustering large, high-dimensional data sets with better performance than traditional algorithms.

Contribution

It offers the first convergence guarantees for Epanechnikov Mean Shift and introduces practical modifications for large-scale clustering tasks.

Findings

01

Guaranteed convergence to local maxima of the density estimate.

02

Effective clustering performance on large, high-dimensional data.

03

Outperforms Lloyd's K-means and EM algorithms in experiments.

Abstract

Epanechnikov Mean Shift is a simple yet empirically very effective algorithm for clustering. It localizes the centroids of data clusters via estimating modes of the probability distribution that generates the data points, using the `optimal' Epanechnikov kernel density estimator. However, since the procedure involves non-smooth kernel density functions, the convergence behavior of Epanechnikov mean shift lacks theoretical support as of this writing---most of the existing analyses are based on smooth functions and thus cannot be applied to Epanechnikov Mean Shift. In this work, we first show that the original Epanechnikov Mean Shift may indeed terminate at a non-critical point, due to the non-smoothness nature. Based on our analysis, we propose a simple remedy to fix it. The modified Epanechnikov Mean Shift is guaranteed to terminate at a local maximum of the estimated density, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Face and Expression Recognition