SLAM: SLO-Aware Memory Optimization for Serverless Applications

Gor Safaryan; Anshul Jindal; Mohak Chadha; Michael Gerndt

arXiv:2207.06183·cs.DC·July 14, 2022

SLAM: SLO-Aware Memory Optimization for Serverless Applications

Gor Safaryan, Anshul Jindal, Mohak Chadha, Michael Gerndt

PDF

Open Access

TL;DR

This paper introduces SLAM, a tool that optimizes memory configurations for serverless applications by modeling function relationships and execution times to meet SLOs while minimizing costs.

Contribution

SLAM is the first tool to optimize memory settings for complex serverless workflows using distributed tracing and execution modeling.

Findings

01

Over 95% of requests meet SLOs with SLAM's configurations.

02

SLAM reduces costs while maintaining performance.

03

Effective for multiple real-world AWS Lambda applications.

Abstract

Serverless computing paradigm has become more ingrained into the industry, as it offers a cheap alternative for application development and deployment. This new paradigm has also created new kinds of problems for the developer, who needs to tune memory configurations for balancing cost and performance. Many researchers have addressed the issue of minimizing cost and meeting Service Level Objective (SLO) requirements for a single FaaS function, but there has been a gap for solving the same problem for an application consisting of many FaaS functions, creating complex application workflows. In this work, we designed a tool called SLAM to address the issue. SLAM uses distributed tracing to detect the relationship among the FaaS functions within a serverless application. By modeling each of them, it estimates the execution time for the application at different memory configurations. Using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Peer-to-Peer Network Technologies