A Learned Cache Eviction Framework with Minimal Overhead

Dongsheng Yang; Daniel S. Berger; Kai Li; Wyatt Lloyd

arXiv:2301.11886·cs.OS·January 30, 2023·1 cites

A Learned Cache Eviction Framework with Minimal Overhead

Dongsheng Yang, Daniel S. Berger, Kai Li, Wyatt Lloyd

PDF

Open Access

TL;DR

This paper presents MAT, a framework that integrates machine learning with traditional cache algorithms to significantly reduce prediction overhead while maintaining cache efficiency across various workloads.

Contribution

MAT introduces a novel approach that uses heuristic filters to minimize ML predictions in cache eviction, improving practicality for high-throughput systems.

Findings

01

Reduces ML predictions per eviction from 63 to 2

02

Achieves comparable cache miss ratios to state-of-the-art ML caches

03

Maintains request rates similar to traditional LRU caches

Abstract

Recent work shows the effectiveness of Machine Learning (ML) to reduce cache miss ratios by making better eviction decisions than heuristics. However, state-of-the-art ML caches require many predictions to make an eviction decision, making them impractical for high-throughput caching systems. This paper introduces Machine learning At the Tail (MAT), a framework to build efficient ML-based caching systems by integrating an ML module with a traditional cache system based on a heuristic algorithm. MAT treats the heuristic algorithm as a filter to receive high-quality samples to train an ML model and likely candidate objects for evictions. We evaluate MAT on 8 production workloads, spanning storage, in-memory caching, and CDNs. The simulation experiments show MAT reduces the number of costly ML predictions-per-eviction from 63 to 2, while achieving comparable miss ratios to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCaching and Content Delivery · Advanced Data Storage Technologies · Data Stream Mining Techniques