Towards Concurrent Stateful Stream Processing on Multicore Processors (Technical Report)
Shuhao Zhang, Yingjun Wu, Feng Zhang, and Bingsheng He

TL;DR
This paper presents TStream, a new data stream processing system that significantly improves concurrent state access scalability on multicore processors using dual-mode scheduling and dynamic restructuring, outperforming existing solutions.
Contribution
TStream introduces two novel techniques for scalable concurrent state access in stream processing, enabling higher throughput and workload flexibility on multicore architectures.
Findings
Achieves up to 4.8x higher throughput than state-of-the-art systems.
Maintains similar latency while improving scalability.
Handles workload variations like key skewness effectively.
Abstract
Recent data stream processing systems (DSPSs) can achieve excellent performance when processing large volumes of data under tight latency constraints. However, they sacrifice support for concurrent state access that eases the burden of developing stateful stream applications. Recently, some have proposed managing concurrent state access during stream processing by modeling state accesses as transactions. However, these are realized with locks involving serious contention overhead. Their coarse-grained processing paradigm further magnifies contention issues and tends to poorly utilize modern multicore architectures. This paper introduces TStream , a novel DSPS supporting efficient concurrent state access on multicore processors. Transactional semantics is employed like previous work, but scalability is greatly improved due to two novel designs: 1) dual-mode scheduling, which exposes more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed systems and fault tolerance · Advanced Database Systems and Queries
