
TL;DR
This paper introduces pseudo-centroid clustering, a novel approach that replaces traditional centroids with pseudo-centroids suitable for non-numeric data, enabling clustering in social sciences and related fields.
Contribution
It formulates a K-PC clustering algorithm with MinMax and MinSum pseudo-centroids, along with initialization and diversification methods, extending clustering techniques to non-numeric characteristic spaces.
Findings
Developed K-PC algorithms for diverse data types.
Proved theorems for optimal cluster size selection.
Introduced a Regret-Threshold PC algorithm with new quality metrics.
Abstract
Pseudo-Centroid Clustering replaces the traditional concept of a centroid expressed as a center of gravity with the notion of a pseudo-centroid (or a coordinate free centroid) which has the advantage of applying to clustering problems where points do not have numerical coordinates (or categorical coordinates that are translated into numerical form). Such problems, for which classical centroids do not exist, are particularly important in social sciences, marketing, psychology and economics, where distances are not computed from vector coordinates but rather are expressed in terms of characteristics such as affinity relationships, psychological preferences, advertising responses, polling data, market interactions and so forth, where distances, broadly conceived, measure the similarity (or dissimilarity) of characteristics, functions or structures. We formulate a K-PC algorithm analogous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSensory Analysis and Statistical Methods · Advanced Clustering Algorithms Research · Cognitive and psychological constructs research
