Performance Guarantees for Distributed Reachability Queries
Wenfei Fan, Xin Wang, Yinghui Wu

TL;DR
This paper introduces distributed algorithms for reachability queries on fragmented graphs, providing performance guarantees and demonstrating scalability in real-world and synthetic datasets.
Contribution
It presents novel distributed evaluation algorithms for reachability, bounded reachability, and regular reachability, with proven performance guarantees and MapReduce implementation.
Findings
Algorithms visit each site only once
Network traffic depends on query size and fragmentation
Algorithms are scalable on large, distributed graphs
Abstract
In the real world a graph is often fragmented and distributed across different sites. This highlights the need for evaluating queries on distributed graphs. This paper proposes distributed evaluation algorithms for three classes of queries: reachability for determining whether one node can reach another, bounded reachability for deciding whether there exists a path of a bounded length between a pair of nodes, and regular reachability for checking whether there exists a path connecting two nodes such that the node labels on the path form a string in a given regular expression. We develop these algorithms based on partial evaluation, to explore parallel computation. When evaluating a query Q on a distributed graph G, we show that these algorithms possess the following performance guarantees, no matter how G is fragmented and distributed: (1) each site is visited only once; (2) the total…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Cloud Computing and Resource Management · Caching and Content Delivery
