Analyzing a Two-Tier Disaggregated Memory Protection Scheme Based on Memory Replication
Haris Volos, Yiannakis Sazeides

TL;DR
This paper introduces RAMP, a model for optimizing memory error protection strategies in disaggregated memory systems using two-tier replication, significantly reducing storage overhead while maintaining robustness.
Contribution
The paper presents RAMP, a novel model that improves storage efficiency of memory protection schemes by considering the interaction between error-correcting codes and replication.
Findings
Reduces memory protection storage cost from 27% to 17.7%.
Enhances storage efficiency of state-of-the-art protection mechanisms.
Achieves minimal performance overhead with optimized protection strategies.
Abstract
As memory technologies continue to shrink and memory error rates increase, the demand for stronger reliability becomes increasingly critical. Fine-grain memory replication has emerged as an appealing approach to improving memory fault tolerance by augmenting conventional memory protection based on error-correcting codes with an additional layer of redundancy that replicates data across independent failure domains, such as replicating memory pages across different NUMA sockets. This method can tolerate a broad spectrum of memory errors, from individual memory cell failures to more complex memory controller failures. However, applying memory replication without a holistic consideration of the interaction between error-correcting codes and replication can result in redundant duplication and unnecessary storage overhead. We propose Replication-Aware Memory-error Protection (RAMP), a model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSecurity and Verification in Computing
