Tuning the Tail Latency of Distributed Queries Using Replication
Nathan Ng, Hung Le, Marco Serafini

TL;DR
This paper addresses the challenge of minimizing data replication in distributed graph databases to meet user-defined low-latency query constraints, proposing heuristics and conditions for effective replication schemes.
Contribution
It introduces a novel workload model for data access causality and develops heuristics with correctness conditions to optimize replication for latency constraints.
Findings
Algorithms enable fine-tuning of query latency and replication trade-offs.
Evaluation shows effective latency reduction with minimal replication.
Methods find optimal points in latency/replication design space.
Abstract
Querying graph data with low latency is an important requirement in application domains such as social networks and knowledge graphs. Graph queries perform multiple hops between vertices. When data is partitioned and stored across multiple servers, queries executing at one server often need to hop to vertices stored by another server. Such distributed traversals represent a performance bottleneck for low-latency queries. To reduce query latency, one can replicate remote data to make distributed traversals unnecessary, but replication is expensive and should be minimized. In this paper, we introduce the problem of finding data replication schemes that satisfy arbitrary user-defined query latency constraints with minimal replication cost. We propose a novel workload model to express data access causality, propose a family of heuristics, and introduce non-trivial sufficient conditions for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Cloud Computing and Resource Management · Distributed systems and fault tolerance
