Mercury: QoS-Aware Tiered Memory System
Jiaheng Lu, Yiwen Zhang, Hasan Al Maruf, Minseo Park, Yunxuan Tang,, Fan Lai, Mosharaf Chowdhury

TL;DR
Mercury is a QoS-aware tiered memory system that manages memory resources proactively to ensure predictable performance for multiple applications with different SLOs, adapting dynamically to changing requirements.
Contribution
It introduces per-tier page reclamation and a proactive admission control algorithm to meet SLOs and mitigate interference in tiered memory systems.
Findings
Improves application performance by up to 53.4% over TPP.
Enhances predictability for coexisting applications with different SLOs.
Effectively mitigates intra- and inter-tier bandwidth interference.
Abstract
Memory tiering has received wide adoption in recent years as an effective solution to address the increasing memory demands of memory-intensive workloads. However, existing tiered memory systems often fail to meet service-level objectives (SLOs) when multiple applications share the system because they lack Quality-of-Service (QoS) support. Consequently, applications suffer severe performance drops due to local memory contention and memory bandwidth interference. In this paper, we present Mercury, a QoS-aware tiered memory system that ensures predictable performance for coexisting memory-intensive applications with different SLOs. Mercury enables per-tier page reclamation for application-level resource management and uses a proactive admission control algorithm to satisfy SLOs via per-tier memory capacity allocation and intra- and inter-tier bandwidth interference mitigation. It reacts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies
