Model-Free Control for Distributed Stream Data Processing using Deep   Reinforcement Learning

Teng Li; Zhiyuan Xu; Jian Tang; Yanzhi Wang

arXiv:1803.01016·cs.DC·March 6, 2018

Model-Free Control for Distributed Stream Data Processing using Deep Reinforcement Learning

Teng Li, Zhiyuan Xu, Jian Tang, Yanzhi Wang

PDF

TL;DR

This paper introduces a novel deep reinforcement learning framework for distributed stream data processing systems, significantly improving scheduling efficiency by minimizing processing time without relying on explicit system models.

Contribution

It pioneers the application of deep reinforcement learning for model-free control in distributed stream processing, enabling adaptive scheduling based on limited runtime data.

Findings

01

Reduces average tuple processing time by 33.5% compared to default scheduler

02

Outperforms state-of-the-art model-based methods in efficiency

03

Achieves quick convergence during online learning

Abstract

In this paper, we focus on general-purpose Distributed Stream Data Processing Systems (DSDPSs), which deal with processing of unbounded streams of continuous data at scale distributedly in real or near-real time. A fundamental problem in a DSDPS is the scheduling problem with the objective of minimizing average end-to-end tuple processing time. A widely-used solution is to distribute workload evenly over machines in the cluster in a round-robin manner, which is obviously not efficient due to lack of consideration for communication delay. Model-based approaches do not work well either due to the high complexity of the system environment. We aim to develop a novel model-free approach that can learn to well control a DSDPS from its experience rather than accurate and mathematically solvable system models, just as a human learns a skill (such as cooking, driving, swimming, etc).…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.