Approximately Counting Subgraphs in Data Streams
Hendrik Fichtenberger, Pan Peng

TL;DR
This paper introduces improved streaming algorithms for approximately counting subgraphs in data streams, utilizing a generic transformation from query models to streaming models, achieving better space complexity for various subgraph counts.
Contribution
The paper presents a novel transformation from sublinear-time query algorithms to streaming algorithms, enabling efficient approximate counting of subgraphs like triangles and cliques in data streams.
Findings
Developed a 3-pass streaming algorithm for subgraph counting with improved space bounds.
Resolved a conjecture on approximating clique counts in graphs with bounded degeneracy.
Generalized the approach to all algorithms in standard sublinear graph query models.
Abstract
Estimating the number of subgraphs in data streams is a fundamental problem that has received great attention in the past decade. In this paper, we give improved streaming algorithms for approximately counting the number of occurrences of an arbitrary subgraph , denoted , when the input graph is represented as a stream of edges. To obtain our algorithms, we provide a generic transformation that converts constant-round sublinear-time graph algorithms in the query access model to constant-pass sublinear-space graph streaming algorithms. Using this transformation, we obtain the following results. 1. We give a -pass turnstile streaming algorithm for -approximating in space, where is the fractional edge-cover of . This improves upon and generalizes a result of McGregor et al. [PODS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Complexity and Algorithms in Graphs · Data Stream Mining Techniques
