An Imitation Learning Approach for Cache Replacement
Evan Zheran Liu, Milad Hashemi, Kevin Swersky, Parthasarathy, Ranganathan, Junwhan Ahn

TL;DR
This paper introduces Parrot, an imitation learning approach that approximates an optimal cache eviction policy, significantly improving cache hit rates across diverse applications by learning from future access patterns.
Contribution
It presents a novel imitation learning method for cache replacement that leverages Belady's oracle to improve cache performance without future access knowledge.
Findings
Parrot increases cache miss rates by 20% over state-of-the-art methods on SPEC applications.
Parrot improves cache hit rates by 61% over LRU on a web search benchmark.
The authors release a Gym environment to support further research in cache replacement policies.
Abstract
Program execution speed critically depends on increasing cache hits, as cache hits are orders of magnitude faster than misses. To increase cache hits, we focus on the problem of cache replacement: choosing which cache line to evict upon inserting a new line. This is challenging because it requires planning far ahead and currently there is no known practical solution. As a result, current replacement policies typically resort to heuristics designed for specific common access patterns, which fail on more diverse and complex access patterns. In contrast, we propose an imitation learning approach to automatically learn cache access patterns by leveraging Belady's, an oracle policy that computes the optimal eviction decision given the future cache accesses. While directly applying Belady's is infeasible since the future is unknown, we train a policy conditioned only on past accesses that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData Stream Mining Techniques · Cloud Computing and Resource Management · Advanced Neural Network Applications
MethodsAttention Is All You Need · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Linear Layer · Tanh Activation · Sigmoid Activation · Softmax · Long Short-Term Memory · Parrot
