Faster motif counting via succinct color coding and adaptive sampling
Marco Bressan, Stefano Leucci, Alessandro Panconesi

TL;DR
This paper introduces optimized color coding and adaptive sampling techniques for faster, more accurate motif counting in large graphs, enabling the detection of rare motifs efficiently.
Contribution
It extends color coding applicability with algorithmic optimizations and a novel adaptive sampling scheme that improves accuracy and scalability for large graph motif counting.
Findings
Built a compact table for 8-node motifs in one hour on large graphs
Reduced running time and space usage significantly compared to previous methods
Counted rare motifs with high accuracy, impossible with uniform sampling
Abstract
We address the problem of computing the distribution of induced connected subgraphs, aka \emph{graphlets} or \emph{motifs}, in large graphs. The current state-of-the-art algorithms estimate the motif counts via uniform sampling, by leveraging the color coding technique by Alon, Yuster and Zwick. In this work we extend the applicability of this approach, by introducing a set of algorithmic optimizations and techniques that reduce the running time and space usage of color coding and improve the accuracy of the counts. To this end, we first show how to optimize color coding to efficiently build a compact table of a representative subsample of all graphlets in the input graph. For -node motifs, we can build such a table in one hour for a graph with M nodes and B edges, which is times larger than the state of the art. We then introduce a novel adaptive sampling scheme that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
