Optimized Dynamic Cache Instantiation and Accurate LRU Approximations under Time-varying Request Volume
Niklas Carlsson, Derek Eager

TL;DR
This paper explores dynamic cache instantiation in cloud environments, providing models and approximations that optimize cache costs and performance under fluctuating request rates, with significant potential savings.
Contribution
It introduces novel analytic models for time-varying cache workloads, including bounds and approximations for LRU caches, and evaluates cost-performance tradeoffs of dynamic cache deployment.
Findings
Dynamic cache instantiation yields substantial cost savings.
Savings depend heavily on object popularity skew.
Selective cache insertion enhances benefits over traditional edge caches.
Abstract
Content-delivery applications can achieve scalability and reduce wide-area network traffic using geographically distributed caches. However, each deployed cache has an associated cost, and under time-varying request rates (e.g., a daily cycle) there may be long periods when the request rate from the local region is not high enough to justify this cost. Cloud computing offers a solution to problems of this kind, by supporting dynamic allocation and release of resources. In this paper, we analyze the potential benefits from dynamically instantiating caches using resources from cloud service providers. We develop novel analytic caching models that accommodate time-varying request rates, transient behavior as a cache fills following instantiation, and selective cache insertion policies. Within the context of a simple cost model, we then develop bounds and compare policies with optimized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
