Distributed Estimation of Graph 4-Profiles
Ethan R. Elenberg, Karthikeyan Shanmugam, Michael Borokhovich,, Alexandros G. Dimakis

TL;DR
This paper introduces a distributed, message-passing algorithm for efficiently estimating global and local 4-profiles in large graphs, enabling rapid analysis of connectivity patterns with theoretical guarantees.
Contribution
The paper presents a novel distributed algorithm for counting 4-node subgraphs and estimating local 4-profiles using compressed two-hop information and graph sparsification techniques.
Findings
Algorithm computes global and local 4-profiles in minutes on graphs with millions of edges.
Scales up to 640 cores with significant speedup over previous methods.
Provides theoretical guarantees on approximation quality through concentration results.
Abstract
We present a novel distributed algorithm for counting all four-node induced subgraphs in a big graph. These counts, called the -profile, describe a graph's connectivity properties and have found several uses ranging from bioinformatics to spam detection. We also study the more complicated problem of estimating the local -profiles centered at each vertex of the graph. The local -profile embeds every vertex in an -dimensional space that characterizes the local geometry of its neighborhood: vertices that connect different clusters will have different local -profiles compared to those that are only part of one dense cluster. Our algorithm is a local, distributed message-passing scheme on the graph and computes all the local -profiles in parallel. We rely on two novel theoretical contributions: we show that local -profiles can be calculated using compressed two-hop…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
