Ranking Vectors Clustering: Theory and Applications
Ali Fattahi, Ali Eshragh, Babak Aslani, Meysam Rabiee

TL;DR
This paper investigates clustering ranking vectors, introduces an NP-hard problem, and proposes efficient algorithms with theoretical guarantees, validated by experiments on synthetic and real data.
Contribution
It formulates the k-centroids ranking vectors clustering problem, proves its NP-hardness, and develops approximation and exact algorithms with theoretical bounds.
Findings
KRCA outperforms baseline solutions in solution quality
Algorithms are computationally efficient for large datasets
Theoretical error bounds are established for proposed methods
Abstract
We study the problem of clustering ranking vectors, where each vector represents preferences as an ordered list of distinct integers. Specifically, we focus on the k-centroids ranking vectors clustering problem (KRC), which aims to partition a set of ranking vectors into k clusters and identify the centroid of each cluster. Unlike classical k-means clustering (KMC), KRC constrains both the observations and centroids to be ranking vectors. We establish the NP-hardness of KRC and characterize its feasible set. For the single-cluster case, we derive a closed-form analytical solution for the optimal centroid, which can be computed in linear time. To address the computational challenges of KRC, we develop an efficient approximation algorithm, KRCA, which iteratively refines initial solutions from KMC, referred to as the baseline solution. Additionally, we introduce a branch-and-bound (BnB)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Bayesian Methods and Mixture Models · Data Management and Algorithms
