Expectation Distance-based Distributional Clustering for Noise-Robustness
Rahmat Adesunkanmi, Ratnesh Kumar

TL;DR
This paper introduces a noise-robust clustering method based on a new expectation distance that considers joint distributions, extending classical algorithms to operate on data-distributions, and demonstrates improved accuracy and efficiency on real-world datasets.
Contribution
It proposes a novel expectation distance for distributional clustering, extending K-means and K-medoids to operate on data-distributions, with closed-form solutions and demonstrated superior performance.
Findings
Higher clustering accuracy with expectation distance (ED) over traditional methods.
Reduced computation time due to lower complexity.
Effective clustering of weather and stock data using distribution-based methods.
Abstract
This paper presents a clustering technique that reduces the susceptibility to data noise by learning and clustering the data-distribution and then assigning the data to the cluster of its distribution. In the process, it reduces the impact of noise on clustering results. This method involves introducing a new distance among distributions, namely the expectation distance (denoted, ED), that goes beyond the state-of-art distribution distance of optimal mass transport (denoted, for -Wasserstein): The latter essentially depends only on the marginal distributions while the former also employs the information about the joint distributions. Using the ED, the paper extends the classical -means and -medoids clustering to those over data-distributions (rather than raw-data) and introduces -medoids using . The paper also presents the closed-form expressions of the and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Anomaly Detection Techniques and Applications · Automated Road and Building Extraction
