MIPS: Instance Placement for Stream Processing Systems based on Monte Carlo Tree Search
Xi Huang, Ziyu Shao, Yang Yang

TL;DR
This paper introduces MIPS, a novel Monte Carlo Tree Search-based scheme for instance placement in stream processing systems, significantly reducing traffic and improving resource utilization through a two-stage mapping process.
Contribution
It is the first to apply MCTS to the two-stage instance placement problem in stream processing, addressing the challenge with a new, efficient decision-making model.
Findings
MIPS outperforms existing schemes in traffic reduction.
MIPS improves resource utilization.
The approach is effective with a mild number of samples.
Abstract
Stream processing engines enable modern systems to conduct large-scale analytics over unbounded data streams in real time. They often view an application as a direct acyclic graph with streams flowing through pipelined instances of various processing units. One key challenge that emerges is instance placement, i.e., to decide the placement of instances across servers with minimum traffic across servers and maximum resource utilization. The challenge roots in not only its intrinsic complexity but also the impact between successive application deployments. Most updated engines such as Apache Heron exploits a more modularized scheduler design that decomposes the task into two stages: One decides the instance-to-container mapping while the other focuses on the container-to-server mapping that is delegated to standalone resource managers. The unaligned objectives and scheduler designs in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Data Stream Mining Techniques · Advanced Database Systems and Queries
