Scientific Workflows in Heterogeneous Edge-Cloud Computing: A Data Placement Strategy Based on Reinforcement learning
Xin Du

TL;DR
This paper introduces a reinforcement learning-based data placement strategy for scientific workflows in heterogeneous edge-cloud environments, effectively reducing data transmission time by dynamically optimizing dataset distribution.
Contribution
It presents a novel two-stage data placement model combining particle swarm optimization and reinforcement learning for dynamic dataset distribution in edge-cloud workflows.
Findings
Reduces data transmission time compared to existing strategies
Effective dynamic dataset distribution across geographical regions
Improves workflow execution efficiency in heterogeneous environments
Abstract
The heterogeneous edge-cloud computing paradigm can provide an optimal solution to deploy scientific workflows compared to cloud computing or other traditional distributed computing environments. Owing to the different sizes of scientific datasets and the privacy issue concerning some of these datasets, it is essential to find a data placement strategy that can minimize data transmission time. Some state-of-the-art data placement strategies combine edge computing and cloud computing to distribute scientific datasets. However, the dynamic distribution of newly generated datasets to appropriate datacenters and exiting the spent datasets are still a challenge during workflows execution. To address this challenge, this study not only constructs a data placement model that includes shared datasets within individual and among multiple workflows across various geographical regions, but also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Cloud Computing and Resource Management · Privacy-Preserving Technologies in Data
