TL;DR
This paper examines how different choices in defining configuration models for fixed degree sequences affect network analysis, emphasizing the importance of graph labeling and demonstrating significant impacts on study conclusions.
Contribution
It clarifies the role of graph labeling in configuration models and provides guidance for selecting appropriate models in various network analyses.
Findings
Graph labeling choices significantly impact multigraph and self-loop analyses.
Different configuration models can lead to substantially different study conclusions.
Only one configuration model is appropriate for each specific network analysis context.
Abstract
Random graph null models have found widespread application in diverse research communities analyzing network datasets, including social, information, and economic networks, as well as food webs, protein-protein interactions, and neuronal networks. The most popular family of random graph null models, called configuration models, are defined as uniform distributions over a space of graphs with a fixed degree sequence. Commonly, properties of an empirical network are compared to properties of an ensemble of graphs from a configuration model in order to quantify whether empirical network properties are meaningful or whether they are instead a common consequence of the particular degree sequence. In this work we study the subtle but important decisions underlying the specification of a configuration model, and investigate the role these choices play in graph sampling procedures and a suite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
