CacheMind: From Miss Rates to Why -- Natural-Language, Trace-Grounded Reasoning for Cache Replacement
Kaushal Mhapsekar, Azam Ghanbari, Bita Aslrousta, Samira Mirbagher-Ajorpaz

TL;DR
CacheMind introduces a natural-language, trace-grounded reasoning tool using LLMs and RAG to improve cache replacement analysis, enabling architects to ask semantic questions and receive human-readable, program-aware answers.
Contribution
This work presents CacheMind, the first conversational tool leveraging LLMs and RAG for semantic reasoning over cache traces, with a new benchmark suite for evaluation.
Findings
CacheMind achieves up to 89.33% accuracy on reasoning tasks.
SIEVE retriever improves trace-grounded question retrieval success to 60%.
Concrete insights demonstrate CacheMind's utility in cache optimization.
Abstract
Cache replacement remains a challenging problem in CPU microarchitecture, often addressed using hand-crafted heuristics, limiting cache performance. Cache data analysis requires parsing millions of trace entries with manual filtering, making the process slow and non-interactive. To address this, we introduce CacheMind, a conversational tool that uses Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs) to enable semantic reasoning over cache traces. Architects can now ask natural language questions like, "Why is the memory access associated with PC X causing more evictions?", and receive trace-grounded, human-readable answers linked to program semantics for the first time. To evaluate CacheMind, we present CacheMindBench, the first verified benchmark suite for LLM-based reasoning for the cache replacement problem. Using the SIEVE retriever, CacheMind achieves 66.67% on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Software System Performance and Reliability · Software Engineering Research
