Lambada: Interactive Data Analytics on Cold Data using Serverless Cloud Infrastructure
Ingo M\"uller, Renato Marroqu\'in, Gustavo Alonso

TL;DR
Lambada demonstrates that a carefully designed serverless data analytics system can efficiently and cost-effectively process cold data, outperforming commercial Query-as-a-Service solutions in speed and cost.
Contribution
This work introduces Lambada, a novel serverless architecture for interactive cold data analytics, overcoming known limitations and demonstrating superior performance and cost savings.
Findings
Lambada is one order of magnitude faster than commercial solutions.
Lambada is two orders of magnitude cheaper than commercial solutions.
The system achieves operational simplicity comparable to existing Query-as-a-Service offerings.
Abstract
The promise of ultimate elasticity and operational simplicity of serverless computing has recently lead to an explosion of research in this area. In the context of data analytics, the concept sounds appealing, but due to the limitations of current offerings, there is no consensus yet on whether or not this approach is technically and economically viable. In this paper, we identify interactive data analytics on cold data as a use case where serverless computing excels. We design and implement Lambada, a system following a purely serverless architecture, in order to illustrate when and how serverless computing should be employed for data analytics. We propose several system components that overcome the previously known limitations inherent in the serverless paradigm as well as additional ones we identify in this work. We can show that, thanks to careful design, a serverless query…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
