Content Replication in Large Distributed Caches
Sharayu Moharir, Nikhil Karamchandani

TL;DR
This paper addresses optimizing content replication in large distributed caches to minimize central server load, introducing a novel policy inspired by the Knapsack problem that achieves theoretical lower bounds.
Contribution
It establishes a lower bound on transmission rates and proposes the Knapsack Storage policy, which optimally balances content replication in distributed caches.
Findings
Knapsack Storage achieves the theoretical lower bound on transmission rate.
In some cases, less popular content is replicated more than the most popular.
The approach provides a new perspective on content placement strategies.
Abstract
In this paper, we consider the algorithmic task of content replication and request routing in a distributed caching system consisting of a central server and a large number of caches, each with limited storage and service capabilities. We study a time-slotted system where in each time-slot, a large batch of requests has to be matched to a large number of caches, where each request can be served by any cache which stores the requested content. All requests which cannot be served by the caches are served by fetching the requested content from the central server. The goal is to minimize the transmission rate from the central server. We use a novel mapping between our content replication problem and the Knapsack problem to prove a lower bound on the transmission rate for any content replication policy. Using insights obtained from the mapping, we propose a content replication policy -…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
