Fast and Accurate Triangle Counting in Graph Streams Using Predictions
Cristian Boldrin, Fabio Vandin

TL;DR
This paper introduces a novel algorithm for triangle counting in graph streams that leverages predictions to improve speed and accuracy, using a combination of sampling techniques and a simple degree-based predictor.
Contribution
It presents the first practical algorithm that integrates predictions into triangle counting in graph streams, achieving faster and more accurate estimates with guaranteed memory bounds.
Findings
Algorithm outperforms state-of-the-art in speed for fixed memory.
Estimates have reduced variance when predictions are informative.
Effective even with imperfect predictors and across multiple graph streams.
Abstract
In this work, we present the first efficient and practical algorithm for estimating the number of triangles in a graph stream using predictions. Our algorithm combines waiting room sampling and reservoir sampling with a predictor for the heaviness of edges, that is, the number of triangles in which an edge is involved. As a result, our algorithm is fast, provides guarantees on the amount of memory used, and exploits the additional information provided by the predictor to produce highly accurate estimates. We also propose a simple and domain-independent predictor, based on the degree of nodes, that can be easily computed with one pass on a stream of edges when the stream is available beforehand. Our analytical results show that, when the predictor provides useful information on the heaviness of edges, it leads to estimates with reduced variance compared to the state-of-the-art, even…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Data Stream Mining Techniques · Advanced Graph Neural Networks
