Scalable Pattern Matching in Metadata Graphs via Constraint Checking
Tahsin Reza, Hassan Halawa, Matei Ripeanu, Geoffrey Sanders, Roger, Pearce

TL;DR
This paper introduces a scalable pattern matching algorithm for large labeled graphs based on constraint checking, enabling efficient, exact, and comprehensive pattern searches on massive datasets using a vertex-centric framework.
Contribution
The work presents a novel constraint checking approach for exact pattern matching, scalable to massive graphs, and integrates it into a high-performance vertex-centric framework.
Findings
Supports arbitrary patterns with 100% precision
Achieves high scalability on graphs with trillions of edges
Demonstrates effectiveness on real-world large-scale graphs
Abstract
Pattern matching is a fundamental tool for answering complex graph queries. Unfortunately, existing solutions have limited capabilities: they do not scale to process large graphs and/or support only a restricted set of search templates or usage scenarios. We present an algorithmic pipeline that bases pattern matching on constraint checking. The key intuition is that each vertex or edge participating in a match has to meet a set of constrains implicitly specified by the search template. The pipeline we propose, generates these constraints and iterates over them to eliminate all the vertices and edges that do not participate in any match, and reduces the background graph to a subgraph which is the union of all matches. Additional analysis can be performed on this annotated, reduced graph, such as full match enumeration. Furthermore, a vertex-centric formulation for constraint checking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
