SubGraph2Vec: Highly-Vectorized Tree-likeSubgraph Counting
Langshi Chen, Jiayu Li, Ariful Azad, Cenk Sahinalp, Madhav Marathe,, Anil Vullikanti, Andrey Nikolaev, Egor Smirnov, Ruslan Israfilov, Judy Qiu

TL;DR
SubGraph2Vec introduces a highly efficient, vectorized algorithm for counting tree-like subgraphs in large networks, significantly outperforming previous methods and scalable across CPU and GPU architectures.
Contribution
It presents a novel vectorized algorithm for subgraph counting, with implementations that drastically improve performance and scalability for tree-like subgraphs.
Findings
Achieves up to 660x speedup over state-of-the-art on a single node.
Scales to template size 20 in distributed mode with good scalability.
Portability to both CPU and GPU architectures.
Abstract
Subgraph counting aims to count occurrences of a template T in a given network G(V, E). It is a powerful graph analysis tool and has found real-world applications in diverse domains. Scaling subgraph counting problems is known to be memory bounded and computationally challenging with exponential complexity. Although scalable parallel algorithms are known for several graph problems such as Triangle Counting and PageRank, this is not common for counting complex subgraphs. Here we address this challenge and study connected acyclic graphs or trees. We propose a novel vectorized subgraph counting algorithm, named Subgraph2Vec, as well as both shared memory and distributed implementations: 1) reducing algorithmic complexity by minimizing neighbor traversal; 2) achieving a highly-vectorized implementation upon linear algebra kernels to significantly improve performance and hardware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
