Understanding and Optimizing Serverless Workloads in CXL-Enabled Tiered Memory
Yuze Li, Shunyu Yao

TL;DR
This paper investigates the impact of CXL-enabled tiered memory on serverless workloads, proposing a middleware solution called Porter for efficient, application-specific memory provisioning to improve performance and reduce costs.
Contribution
It quantifies CXL effects on serverless applications and introduces Porter, a middleware that optimizes memory placement for hot and cold regions in tiered memory systems.
Findings
CXL impacts vary across serverless applications.
Fine-grained memory provisioning improves performance.
Porter middleware enhances resource utilization and cost efficiency.
Abstract
Recent Serverless workloads tend to be largescaled/CPU-memory intensive, such as DL, graph applications, that require dynamic memory-to-compute resources provisioning. Meanwhile, recent solutions seek to design page management strategies for multi-tiered memory systems, to efficiently run heavy workloads. Compute Express Link (CXL) is an ideal platform for serverless workloads runtime that offers a holistic memory namespace thanks to its cache coherent feature and large memory capacity. However, naively offloading Serverless applications to CXL brings substantial latencies. In this work, we first quantify CXL impacts on various Serverless applications. Second, we argue the opportunity of provisioning DRAM and CXL in a fine-grained, application-specific manner to Serverless workloads, by creating a shim layer to identify, and naively place hot regions to DRAM, while leaving cold/warm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques
