Modularity and Heavy-Tailed Degree Distributions
Larry Wilson

TL;DR
This paper identifies a bias in traditional modularity for clustering graphs with heavy-tailed degree distributions and proposes a simple modification, flat modularity, that improves clustering performance especially for low-degree vertices.
Contribution
The paper introduces flat modularity, a simple variant of modularity, to better handle heavy-tailed degree distributions in graph clustering.
Findings
Flat modularity improves clustering of low-degree vertices.
Modified modularity enhances overall clustering performance.
Traditional modularity performs poorly on graphs with power-law degree distributions.
Abstract
Identifying clusters of vertices in graphs continues to be an important problem, and modularity continues to be used as a tool for solving the problem. Modularity, which measures the quality of a division of the vertices into clusters, explicitly treats vertices of different degrees differently, imposing a larger penalty when high-degree vertices are put in the same cluster. We claim that this unequal treatment negatively impacts the performance of clustering algorithms based on modularity for graphs with heavy-tailed degree distributions. We used the Greedy Modularity hill-climb to find clusters in graphs with power-law degree distributions and observed that it performed poorly clustering low-degree vertices. We propose a simple variant of modularity that we call flat modularity. We found that, using the same algorithm with the modified score instead, we improved the performance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Clustering Algorithms Research · Bioinformatics and Genomic Networks
