An Efficient System for Subgraph Discovery
Aparna Joshi, Yu Zhang, Petko Bogdanov, Jeong-Hyon Hwang

TL;DR
Nuri is a system that efficiently finds the top relevant subgraphs in large graphs using prioritized expansion and pruning, outperforming distributed systems in resource usage and speed.
Contribution
The paper introduces Nuri, a novel single-machine system for targeted subgraph discovery that reduces computational costs through prioritization and pruning strategies.
Findings
Nuri outperforms distributed systems by over 100x in speed.
Nuri efficiently handles large datasets on a single core.
Nuri is effective for clique discovery, subgraph isomorphism, and pattern mining.
Abstract
Subgraph discovery in a single data graph---finding subsets of vertices and edges satisfying a user-specified criteria---is an essential and general graph analytics operation with a wide spectrum of applications. Depending on the criteria, subgraphs of interest may correspond to cliques of friends in social networks, interconnected entities in RDF data, or frequent patterns in protein interaction networks to name a few. Existing systems usually examine a large number of subgraphs while employing many computers and often produce an enormous result set of subgraphs. How can we enable fast discovery of only the most relevant subgraphs while minimizing the computational requirements? We present Nuri, a general subgraph discovery system that allows users to succinctly specify subgraphs of interest and criteria for ranking them. Given such specifications, Nuri efficiently finds the k most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Data Mining Algorithms and Applications
