$t$-$k$-means: A Robust and Stable $k$-means Variant

Yiming Li; Yang Zhang; Qingtao Tang; Weipeng Huang; Yong Jiang,; Shu-Tao Xia

arXiv:1907.07442·cs.LG·February 2, 2021·1 cites

$t$-$k$-means: A Robust and Stable $k$-means Variant

Yiming Li, Yang Zhang, Qingtao Tang, Weipeng Huang, Yong Jiang,, Shu-Tao Xia

PDF

Open Access 1 Repo

TL;DR

The paper introduces t-k-means, a robust and stable variant of the classic k-means clustering algorithm, designed to handle heavy-tailed data and outliers more effectively while improving stability.

Contribution

It proposes a new t-k-means algorithm with theoretical analysis of robustness and stability, along with a fast version to enhance performance.

Findings

01

t-k-means outperforms standard k-means on heavy-tailed data

02

The method demonstrates improved stability with lower variance in results

03

Experiments confirm the effectiveness and efficiency of t-k-means

Abstract

$k$ -means algorithm is one of the most classical clustering methods, which has been widely and successfully used in signal processing. However, due to the thin-tailed property of the Gaussian distribution, $k$ -means algorithm suffers from relatively poor performance on the dataset containing heavy-tailed data or outliers. Besides, standard $k$ -means algorithm also has relatively weak stability, $i . e .$ its results have a large variance, which reduces its credibility. In this paper, we propose a robust and stable $k$ -means variant, dubbed the $t$ - $k$ -means, as well as its fast version to alleviate those problems. Theoretically, we derive the $t$ - $k$ -means and analyze its robustness and stability from the aspect of the loss function and the expression of the clustering center, respectively. Extensive experiments are also conducted, which verify the effectiveness and efficiency of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

THUYimingLi/t-k-means
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Advanced Clustering Algorithms Research · Face and Expression Recognition