
TL;DR
This paper introduces T-Stochastic Graphs, a flexible probabilistic model for hierarchical clustering that addresses non-ultrametric structures in social networks, combining spectral methods with phylogenetic algorithms and proving their statistical consistency.
Contribution
It proposes the T-Stochastic Graph model, a general framework for hierarchical clustering without ultrametric constraints, and develops a spectral clustering approach with proven consistency.
Findings
Empirical networks often violate ultrametric assumptions.
The spectral clustering method is statistically consistent.
The T-Stochastic Graph model generalizes existing hierarchical models.
Abstract
Previous statistical approaches to hierarchical clustering for social network analysis all construct an "ultrametric" hierarchy. While the assumption of ultrametricity has been discussed and studied in the phylogenetics literature, it has not yet been acknowledged in the social network literature. We show that "non-ultrametric structure" in the network introduces significant instabilities in the existing top-down recovery algorithms. To address this issue, we introduce an instability diagnostic plot and use it to examine a collection of empirical networks. These networks appear to violate the "ultrametric" assumption. We propose a deceptively simple and yet general class of probabilistic models called -Stochastic Graphs which impose no topological restrictions on the latent hierarchy. To illustrate this model, we propose six alternative forms of hierarchical network models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Clustering Algorithms Research · Topological and Geometric Data Analysis
