Graph sub-sampling for divide-and-conquer algorithms in large networks
Eric Yanchenko

TL;DR
This paper compares seven graph sub-sampling algorithms in divide-and-conquer methods for large networks, analyzing their effectiveness in community and core-periphery detection through theoretical and experimental evaluation.
Contribution
It provides a comprehensive comparison of sub-sampling algorithms and derives theoretical bounds for misclassification rates in divide-and-conquer algorithms for core-periphery structures.
Findings
Uniform node sampling performs best for community detection.
Core node sampling improves core-periphery detection accuracy.
Divide-and-conquer algorithms often outperform base algorithms in speed and accuracy.
Abstract
As networks continue to increase in size, current methods must be capable of handling large numbers of nodes and edges in order to be practically relevant. Instead of working directly with the entire (large) network, analyzing sub-networks has become a popular approach. Due to a network's inherent inter-connectedness, however, sub-sampling is not a trivial task. While this problem has gained popularity in recent years, it has not received sufficient attention from the statistics community. In this work, we provide a thorough comparison of seven graph sub-sampling algorithms by applying them to divide-and-conquer algorithms for community structure and core-periphery (CP) structure. After discussing the various algorithms and sub-sampling routines, we derive theoretical results for the mis-classification rate of the divide-and-conquer algorithm for CP structure under various sub-sampling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Data Management and Algorithms · Advanced Clustering Algorithms Research
