# Using Colors and Sketches to Count Subgraphs in a Streaming Graph

**Authors:** Shirin Handjani, Douglas Jungreis, Mark Tiefenbruck

arXiv: 2302.12210 · 2023-02-24

## TL;DR

This paper improves algorithms for estimating subgraph counts in streaming graphs by reducing storage and update time through three modifications, especially for graphs with bounded degree and specific subgraph structures.

## Contribution

The authors introduce three modifications to an existing algorithm, significantly reducing storage and update time for counting subgraphs in streaming graphs under certain conditions.

## Key findings

- Update time per edge is reduced to O(1).
- Storage is decreased by a factor related to graph parameters.
- Applicable to graphs with no leaves and bounded degree.

## Abstract

Suppose we wish to estimate $\#H$, the number of copies of some small graph $H$ in a large streaming graph $G$. There are many algorithms for this task when $H$ is a triangle, but just a few that apply to arbitrary $H$. Here we focus on one such algorithm, which was introduced by Kane, Mehlhorn, Sauerwald, and Sun. The storage and update time per edge for their algorithm are both $O(m^k/(\#H)^2)$, where $m$ is the number of edges in $G$, and $k$ is the number of edges in $H$. Here, we propose three modifications to their algorithm that can dramatically reduce both the storage and update time. Suppose that $H$ has no leaves and that $G$ has maximum degree $\leq m^{1/2 - \alpha}$, where $\alpha > 0$. Define $C = \min(m^{2\alpha},m^{1/3})$. Then in our version of the algorithm, the update time per edge is $O(1)$, and the storage is approximately reduced by a factor of $C^{2k-t-2}$, where $t$ is the number of vertices in $H$; in particular, the storage is $O(C^2 + m^k/(C^{2k-t-2} (\#H)^2))$.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2302.12210/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/2302.12210/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/2302.12210/full.md

---
Source: https://tomesphere.com/paper/2302.12210