Implicit models, latent compression, intrinsic biases, and cheap lunches in community detection
Tiago P. Peixoto, Alec Kirkley

TL;DR
This paper introduces a unified framework linking community detection objectives to their implicit generative models, enabling principled comparison of algorithms based on description length, revealing biases, and challenging the 'no free lunch' theorem in structured networks.
Contribution
It develops a method to associate community detection objectives with generative models, allowing objective comparison and analysis of biases across algorithms.
Findings
More expressive methods achieve better compression on structured data.
Descriptive methods tend to overfit due to intrinsic biases.
Structured data challenges the 'no free lunch' theorem in community detection.
Abstract
The task of community detection, which aims to partition a network into clusters of nodes to summarize its large-scale structure, has spawned the development of many competing algorithms with varying objectives. Some community detection methods are inferential, explicitly deriving the clustering objective through a probabilistic generative model, while other methods are descriptive, dividing a network according to an objective motivated by a particular application, making it challenging to compare these methods on the same scale. Here we present a solution to this problem that associates any community detection objective, inferential or descriptive, with its corresponding implicit network generative model. This allows us to compute the description length of a network and its partition under arbitrary objectives, providing a principled measure to compare the performance of different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Data Visualization and Analytics · Advanced Clustering Algorithms Research
