To Migrate or not to Migrate: An Analysis of Operator Migration in Distributed Stream Processing
Espen Volnes, Thomas Plagemann, Vera Goebel

TL;DR
This paper reviews operator migration in distributed stream processing, introducing a unified model, analyzing existing solutions, and providing empirical insights into the impact of different migration strategies.
Contribution
It offers a comprehensive conceptual model, classifies existing solutions, and presents an empirical study on migration mechanisms and their effects.
Findings
Migration mechanisms vary significantly across solutions.
Cost-benefit analysis is crucial but often overlooked in migration decisions.
Empirical results highlight the impact of design choices on migration performance.
Abstract
One of the most important issues in data stream processing systems is to use operator migration to handle highly variable workloads in a cost-efficient manner and adapt to the needs at any given time on demand. Operator migration is a complex process that involves changes in the state and stream management of a running query, typically without any loss of data, and with as little disruption to the execution as possible. This survey provides an overview of solutions for operator migration from a historical perspective as well as the perspective of the goal of migration. It introduces a conceptual model of operator migration to establish a unified terminology and classify existing solutions. Existing work in the area is analyzed to separate the mechanism of migration from the decision to migrate the data. In case of the latter, a cost-benefit analysis is emphasized that is important for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Cloud Computing and Resource Management · Distributed systems and fault tolerance
