Optimal Data Placement for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments
Xin Du, Songtao Tang, Zhihui Lu, Keke Gai, Jie Wu, and Patrick C.K., Hung

TL;DR
This paper introduces a novel data placement strategy for scientific workflows in heterogeneous edge-cloud environments, optimizing data transfer and cost through an intelligent, constrained model and a hybrid optimization algorithm.
Contribution
It proposes a new data placement model considering multiple constraints and a hybrid optimization algorithm, DE-DPSO-DPS, to improve data sharing efficiency in edge-cloud systems.
Findings
Reduces data transmission time significantly.
Lowers data placement costs compared to traditional methods.
Effectively handles privacy and geographical constraints.
Abstract
The heterogeneous edge-cloud computing paradigm can provide a more optimal direction to deploy scientific workflows than traditional distributed computing or cloud computing environments. Due to the different sizes of scientific datasets and some of these datasets must keep private, it is still a difficult problem to finding an data placement strategy that can minimize data transmission as well as placement cost. To address this issue, this paper combines advantages of both edge and cloud computing to construct a data placement model, which can balance data transfer time and data placement cost using intelligent computation. The most difficult research challenge the model solved is to consider many constrain in this hybrid computing environments, which including shared datasets within individual and among multiple workflows across various geographical regions. According to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems
