Democratizing Scalable Cloud Applications: Transactional Stateful Functions on Streaming Dataflows
Kyriakos Psarakis

TL;DR
This paper introduces a new programming model and streaming dataflow engine that enable scalable, fault-tolerant, and transactional cloud applications, making complex cloud development more accessible and efficient.
Contribution
It presents Stateflow, a high-level programming model, and Styx, a streaming engine supporting serializable transactions and elasticity, advancing cloud application development.
Findings
Styx outperforms existing systems in transaction throughput and latency.
Stateflow simplifies application development with minimal boilerplate.
Styx provides deterministic, fault-tolerant transactions with elastic scaling.
Abstract
Web applications underpin much of modern digital life, yet building scalable and consistent cloud applications remains difficult, requiring expertise across cloud computing, distributed systems, databases, and software engineering. These demands restrict development to a small number of highly specialized engineers. This thesis aims to democratize cloud application development by addressing three challenges: programmability, high-performance fault-tolerant serializable transactions, and serverless semantics. The thesis identifies strong parallels between cloud applications and the streaming dataflow execution model. It first explores this connection through T-Statefun, a transactional extension of Apache Flink Statefun, demonstrating that dataflow systems can support transactional cloud applications via a stateful functions-as-a-service API. However, this approach revealed significant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems
