Estimating the Number of Connected Components in a Graph via Subgraph Sampling
Jason M. Klusowski, Yihong Wu

TL;DR
This paper investigates the feasibility of estimating the number of connected components in large graphs through subgraph sampling, establishing bounds and efficient estimators specifically for chordal graphs.
Contribution
It characterizes the optimal sample complexity for chordal graphs and introduces linear-time estimators that achieve these bounds, expanding prior work beyond special graph classes.
Findings
Impossible to estimate in sublinear regime for graphs with high-degree vertices or long cycles.
Achieves optimal sample complexity bounds for chordal graphs.
Proposes linear-time estimators that are provably optimal.
Abstract
Learning properties of large graphs from samples has been an important problem in statistical network analysis since the early work of Goodman \cite{Goodman1949} and Frank \cite{Frank1978}. We revisit a problem formulated by Frank \cite{Frank1978} of estimating the number of connected components in a large graph based on the subgraph sampling model, in which we randomly sample a subset of the vertices and observe the induced subgraph. The key question is whether accurate estimation is achievable in the \emph{sublinear} regime where only a vanishing fraction of the vertices are sampled. We show that it is impossible if the parent graph is allowed to contain high-degree vertices or long induced cycles. For the class of chordal graphs, where induced cycles of length four or above are forbidden, we characterize the optimal sample complexity within constant factors and construct linear-time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Graph Theory Research · Complexity and Algorithms in Graphs
