On-demand Container Loading in AWS Lambda
Marc Brooker, Mike Danilov, Chris Greenwood, Phil Piwonka

TL;DR
This paper discusses the design and implementation of an on-demand container loading system for AWS Lambda, enabling large container images to be delivered efficiently at massive scale while maintaining low latency and high security.
Contribution
It introduces a scalable storage and caching system optimized for on-demand container image delivery in Lambda, addressing challenges of security, efficiency, and latency at scale.
Findings
Processed hundreds of trillions of invocations
Supported container images up to 10GiB in size
Demonstrated high resilience to load and failures
Abstract
AWS Lambda is a serverless event-driven compute service, part of a category of cloud compute offerings sometimes called Function-as-a-service (FaaS). When we first released AWS Lambda, functions were limited to 250MB of code and dependencies, packaged as a simple compressed archive. In 2020, we released support for deploying container images as large as 10GiB as Lambda functions, allowing customers to bring much larger code bases and sets of dependencies to Lambda. Supporting larger packages, while still meeting Lambda's goals of rapid scale (adding up to 15,000 new containers per second for a single customer, and much more in aggregate), high request rate (millions of requests per second), high scale (millions of unique workloads), and low start-up times (as low as 50ms) presented a significant challenge. We describe the storage and caching system we built, optimized for delivering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Cloud Data Security Solutions · IoT and Edge/Fog Computing
