Practical Bounds on Optimal Caching with Variable Object Sizes
Daniel S. Berger, Nathan Beckmann, Mor Harchol-Balter

TL;DR
This paper introduces a flow-based method to compute tight bounds on the optimal caching miss ratio with variable object sizes, revealing the potential for significant improvements over current caching policies.
Contribution
The paper presents a novel flow-based approach (FOO) for accurately bounding the optimal caching performance with variable object sizes, and extends it to practical scenarios (PFOO) for large-scale traces.
Findings
FOO achieves less than 0.3% error on 10 million requests.
Current caching systems miss 11-43% more than the optimal.
Prior bounds suggested near-optimal performance, but FOO reveals significant room for improvement.
Abstract
Many recent caching systems aim to improve miss ratios, but there is no good sense among practitioners of how much further miss ratios can be improved. In other words, should the systems community continue working on this problem? Currently, there is no principled answer to this question. In practice, object sizes often vary by several orders of magnitude, where computing the optimal miss ratio (OPT) is known to be NP-hard. The few known results on caching with variable object sizes provide very weak bounds and are impractical to compute on traces of realistic length. We propose a new method to compute upper and lower bounds on OPT. Our key insight is to represent caching as a min-cost flow problem, hence we call our method the flow-based offline optimal (FOO). We prove that, under simple independence assumptions, FOO's bounds become tight as the number of objects goes to infinity.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
