Understanding and Optimizing Serverless Workloads in CXL-Enabled Tiered   Memory

Yuze Li; Shunyu Yao

arXiv:2309.01736·cs.DC·September 26, 2023

Understanding and Optimizing Serverless Workloads in CXL-Enabled Tiered Memory

Yuze Li, Shunyu Yao

PDF

Open Access

TL;DR

This paper investigates the impact of CXL-enabled tiered memory on serverless workloads, proposing a middleware solution called Porter for efficient, application-specific memory provisioning to improve performance and reduce costs.

Contribution

It quantifies CXL effects on serverless applications and introduces Porter, a middleware that optimizes memory placement for hot and cold regions in tiered memory systems.

Findings

01

CXL impacts vary across serverless applications.

02

Fine-grained memory provisioning improves performance.

03

Porter middleware enhances resource utilization and cost efficiency.

Abstract

Recent Serverless workloads tend to be largescaled/CPU-memory intensive, such as DL, graph applications, that require dynamic memory-to-compute resources provisioning. Meanwhile, recent solutions seek to design page management strategies for multi-tiered memory systems, to efficiently run heavy workloads. Compute Express Link (CXL) is an ideal platform for serverless workloads runtime that offers a holistic memory namespace thanks to its cache coherent feature and large memory capacity. However, naively offloading Serverless applications to CXL brings substantial latencies. In this work, we first quantify CXL impacts on various Serverless applications. Second, we argue the opportunity of provisioning DRAM and CXL in a fine-grained, application-specific manner to Serverless workloads, by creating a shim layer to identify, and naively place hot regions to DRAM, while leaving cold/warm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCloud Computing and Resource Management · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques