Nonparametric Clustering Stopping Rule Based on Multivariate Median
Hend Gabr, Brian H Willis, and Mohammed Baragilly

TL;DR
This paper presents a new nonparametric method based on the multivariate median for determining the optimal number of clusters, balancing homogeneity and heterogeneity, and demonstrating robustness and superior performance in various datasets.
Contribution
The paper introduces a distribution-free clustering stopping rule using the spatial median, improving robustness and accuracy over existing methods.
Findings
Outperforms 13 established algorithms in 11 comparisons
Robust to outliers and distribution-free
Validated through extensive simulations and real data applications
Abstract
This paper introduces a novel nonparametric criterion for determining the appropriate number of clusters, which is derived from the spatial median. The method is constructed to reconcile two competing objectives of cluster analysis: the preservation of internal homogeneity within clusters and the maximization of heterogeneity across clusters. To this end, the proposed algorithm optimizes the ratio of inter-cluster to intra-cluster variability, incorporating adjustments for both the sample size and the number of clusters. Unlike conventional techniques, the method is distribution-free and demonstrates robustness in the presence of outliers. Its properties were first examined through extensive simulation studies, followed by empirical evaluations on three applied datasets. To further assess comparative performance, the proposed procedure was benchmarked against 13 established algorithms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Bayesian Methods and Mixture Models · Advanced Clustering Algorithms Research
