Robust Trimmed k-means
Olga Dorabiala, J. Nathan Kutz, Aleksandr Aravkin

TL;DR
Robust Trimmed k-means (RTKM) is a new clustering algorithm that effectively identifies outliers and clusters data in both single- and multi-membership scenarios, improving robustness over existing methods.
Contribution
The paper introduces RTKM, a novel extension of k-means that handles outliers and multi-membership data simultaneously, outperforming existing robust clustering algorithms.
Findings
RTKM performs well on single membership data with outliers.
RTKM effectively clusters multi-membership data without outliers.
RTKM outperforms other methods on multi-membership data with outliers.
Abstract
Clustering is a fundamental tool in unsupervised learning, used to group objects by distinguishing between similar and dissimilar features of a given data set. One of the most common clustering algorithms is k-means. Unfortunately, when dealing with real-world data many traditional clustering algorithms are compromised by lack of clear separation between groups, noisy observations, and/or outlying data points. Thus, robust statistical algorithms are required for successful data analytics. Current methods that robustify k-means clustering are specialized for either single or multi-membership data, but do not perform competitively in both cases. We propose an extension of the k-means algorithm, which we call Robust Trimmed k-means (RTKM) that simultaneously identifies outliers and clusters points and can be applied to either single- or multi-membership data. We test RTKM on various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Clustering Algorithms Research · Face and Expression Recognition
Methodsk-Means Clustering
