2.5K-Graphs: from Sampling to Generation
Minas Gjoka, Maciej Kurant, Athina Markopoulou

TL;DR
This paper introduces a practical methodology for generating realistic graphs that match specific network metrics, using sampling-based estimators and efficient algorithms, outperforming existing methods in speed and accuracy.
Contribution
The authors develop a complete approach combining sampling estimators, property adjustment, and topology generation to produce graphs matching joint degree distribution and clustering coefficients.
Findings
Generated graphs closely match original in multiple metrics.
Method is significantly faster than existing techniques.
Graphs preserve key structural properties of real networks.
Abstract
Understanding network structure and having access to realistic graphs plays a central role in computer and social networks research. In this paper, we propose a complete, and practical methodology for generating graphs that resemble a real graph of interest. The metrics of the original topology we target to match are the joint degree distribution (JDD) and the degree-dependent average clustering coefficient (). We start by developing efficient estimators for these two metrics based on a node sample collected via either independence sampling or random walks. Then, we process the output of the estimators to ensure that the target properties are realizable. Finally, we propose an efficient algorithm for generating topologies that have the exact target JDD and a close to the target. Extensive simulations using real-life graphs show that the graphs generated by our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Topological and Geometric Data Analysis · Advanced Clustering Algorithms Research
