A Bring-Your-Own-Model Approach for ML-Driven Storage Placement in Warehouse-Scale Computers
Chenxi Yang, Yan Li, Martin Maas, Mustafa Uysal, Ubaid Ullah Hafeez,, Arif Merchant, Richard McDougall

TL;DR
This paper proposes a novel cross-layer ML approach where workloads train their own models to improve storage placement in warehouse-scale computers, leading to significant TCO savings and practical deployment advantages.
Contribution
It introduces a workload-driven model training strategy that enhances storage placement efficiency and practicality in real-world hyperscale data centers.
Findings
Up to 3.47× TCO savings in deployment and simulation.
Workload-specific models improve adaptability and efficiency.
Practical implementation demonstrated at Google.
Abstract
Storage systems account for a major portion of the total cost of ownership (TCO) of warehouse-scale computers, and thus have a major impact on the overall system's efficiency. Machine learning (ML)-based methods for solving key problems in storage system efficiency, such as data placement, have shown significant promise. However, there are few known practical deployments of such methods. Studying this problem in the context of real-world hyperscale data centers at Google, we identify a number of challenges that we believe cause this lack of practical adoption. Specifically, prior work assumes a monolithic model that resides entirely within the storage layer, an unrealistic assumption in real-world deployments with frequently changing workloads. To address this problem, we introduce a cross-layer approach where workloads instead ''bring their own model''. This strategy moves ML out of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Advanced Data Storage Technologies · Caching and Content Delivery
