Effective Cache Apportioning for Performance Isolation Under Compiler Guidance
Bodhisatwa Chatterjee, Sharjeel Khan, Santosh Pande

TL;DR
Com-CAS is a dynamic cache apportioning system that uses compiler analysis and machine learning to adapt cache allocations in real-time, improving performance and SLA adherence in multi-core servers.
Contribution
It introduces a novel compiler-guided, machine learning-based cache partitioning approach that adapts to application phase behavior without hardware modifications.
Findings
15% average throughput improvement over unpartitioned cache
20% performance gain over state-of-the-art KPart
Maintains SLA compliance with minimal worst-case degradation
Abstract
With a growing number of cores in modern high-performance servers, effective sharing of the last level cache (LLC) is more critical than ever. The primary agenda of such systems is to maximize performance by efficiently supporting multi-tenancy of diverse workloads. However, this could be particularly challenging to achieve in practice, because modern workloads exhibit dynamic phase behaviour, which causes their cache requirements & sensitivities to vary at finer granularities during execution. Unfortunately, existing systems are oblivious to the application phase behavior, and are unable to detect and react quickly enough to these rapidly changing cache requirements, often incurring significant performance degradation. In this paper, we propose Com-CAS, a new apportioning system that provides dynamic cache allocations for co-executing applications. Com-CAS differs from the existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Cloud Computing and Resource Management · Advanced Data Storage Technologies
