How to Count Triangles, without Seeing the Whole Graph
Suman K. Bera, C. Seshadhri

TL;DR
This paper introduces TETRIS, a novel sublinear algorithm for estimating triangle counts in large graphs using only random walk sampling, addressing the challenge of limited graph access.
Contribution
The paper presents the first provably sublinear algorithm for triangle counting in the random walk access model, effective on graphs with low mixing time.
Findings
TETRIS accurately estimates triangle counts within 5% relative error.
The algorithm uses a mix of random walks and degree-biased sampling.
It works efficiently on large graphs with limited access.
Abstract
Triangle counting is a fundamental problem in the analysis of large graphs. There is a rich body of work on this problem, in varying streaming and distributed models, yet all these algorithms require reading the whole input graph. In many scenarios, we do not have access to the whole graph, and can only sample a small portion of the graph (typically through crawling). In such a setting, how can we accurately estimate the triangle count of the graph? We formally study triangle counting in the {\em random walk} access model introduced by Dasgupta et al (WWW '14) and Chierichetti et al (WWW '16). We have access to an arbitrary seed vertex of the graph, and can only perform random walks. This model is restrictive in access and captures the challenges of collecting real-world graphs. Even sampling a uniform random vertex is a hard task in this model. Despite these challenges, we design a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Graph Neural Networks · Caching and Content Delivery
