Model-Free Control for Distributed Stream Data Processing using Deep Reinforcement Learning
Teng Li, Zhiyuan Xu, Jian Tang, Yanzhi Wang

TL;DR
This paper introduces a novel deep reinforcement learning framework for distributed stream data processing systems, significantly improving scheduling efficiency by minimizing processing time without relying on explicit system models.
Contribution
It pioneers the application of deep reinforcement learning for model-free control in distributed stream processing, enabling adaptive scheduling based on limited runtime data.
Findings
Reduces average tuple processing time by 33.5% compared to default scheduler
Outperforms state-of-the-art model-based methods in efficiency
Achieves quick convergence during online learning
Abstract
In this paper, we focus on general-purpose Distributed Stream Data Processing Systems (DSDPSs), which deal with processing of unbounded streams of continuous data at scale distributedly in real or near-real time. A fundamental problem in a DSDPS is the scheduling problem with the objective of minimizing average end-to-end tuple processing time. A widely-used solution is to distribute workload evenly over machines in the cluster in a round-robin manner, which is obviously not efficient due to lack of consideration for communication delay. Model-based approaches do not work well either due to the high complexity of the system environment. We aim to develop a novel model-free approach that can learn to well control a DSDPS from its experience rather than accurate and mathematically solvable system models, just as a human learns a skill (such as cooking, driving, swimming, etc).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
