Modularity and partially observed graphs
Colin McDiarmid, Fiona Skerman

TL;DR
This paper investigates how partial observations of a large graph affect the estimation of its modularity, showing that under certain conditions, observed modularity closely reflects the true modularity, with implications for sampling methods.
Contribution
The paper provides theoretical bounds on how well observed modularity estimates the true modularity in partially observed graphs under random and vertex sampling models.
Findings
High modularity in the underlying graph implies high observed modularity under random sampling.
Sampling vertices in dense graphs allows accurate modularity estimation, unlike in sparse graphs.
Under-sampling can lead to overestimating the graph's modularity.
Abstract
Suppose that there is an unknown underlying graph on a large vertex set, and we can test only a proportion of the possible edges to check whether they are present in . If has high modularity, is the observed graph likely to have high modularity? We see that this is indeed the case under a mild condition, in a natural model where we test edges at random. We find that with probability at least , as long as the expected number edges in is large enough. Similarly, with probability at least , under the stronger condition that the expected average degree in is large enough. Further, under this stronger condition, finding a good partition for helps us to find a good partition for . We also consider the vertex sampling model for partially observing the underlying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Face and Expression Recognition
