Co-Design of Memory-Storage Systems for Workload Awareness with Interpretable Models
Jay Sarkar, Vamsi Pavan Rayaprolu, Abhijeet Bhalerao

TL;DR
This paper presents a machine learning-based co-design framework for memory-storage systems that optimizes error management algorithms in SSDs by analyzing interactions with workloads and memory components, enabling continuous, data-driven improvements.
Contribution
It introduces a novel ML-driven co-design methodology for SSD error management and workload-aware optimization, incorporating interpretable models and workload representation learning.
Findings
Framework evaluates thousands of SSDs across generations.
Enables continuous, data-driven architectural improvements.
Improves error management and workload adaptation.
Abstract
Solid-state storage architectures based on NAND or emerging memory devices (SSD), are fundamentally architected and optimized for both reliability and performance. Achieving these simultaneous goals requires co-design of memory components with firmware-architected Error Management (EM) algorithms for density- and performance-scaled memory technologies. We describe a Machine Learning (ML) for systems methodology and modeling for co-designing the EM subsystem together with the natural variance inherent to scaled silicon process of memory components underlying SSD technology. The modeling analyzes NAND memory components and EM algorithms interacting with comprehensive suite of synthetic (stress-focused and JEDEC) and emulation (YCSB and similar) workloads across Flash Translation abstraction layers, by leveraging a statistically interpretable and intuitively explainable ML algorithm. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Parallel Computing and Optimization Techniques · Cloud Computing and Resource Management
