Homophilic Clustering by Locally Asymmetric Geometry
Deli Zhao, Xiaoou Tang

TL;DR
This paper introduces a novel graph-based clustering algorithm that leverages locally asymmetric geometries and homophilic in-degree structures to effectively identify clusters and noise in high-dimensional, noisy data.
Contribution
It proposes a new algorithm utilizing directed similarity graphs and the HI figure to detect cluster cores and boundaries, improving clustering in noisy, high-dimensional data.
Findings
Effective clustering of noisy high-dimensional data
Ability to identify cluster cores and boundaries
Validated through extensive experiments
Abstract
Clustering is indispensable for data analysis in many scientific disciplines. Detecting clusters from heavy noise remains challenging, particularly for high-dimensional sparse data. Based on graph-theoretic framework, the present paper proposes a novel algorithm to address this issue. The locally asymmetric geometries of neighborhoods between data points result in a directed similarity graph to model the structural connectivity of data points. Performing similarity propagation on this directed graph simply by its adjacency matrix powers leads to an interesting discovery, in the sense that if the in-degrees are ordered by the corresponding sorted out-degrees, they will be self-organized to be homophilic layers according to the different distributions of cluster densities, which is dubbed the Homophilic In-degree figure (the HI figure). With the HI figure, we can easily single out all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Clustering Algorithms Research · Data Visualization and Analytics
