An Empirical Evaluation On the Applicability of the DALiuGE Execution Framework
Ying Mei, Shoulin Wei, Feng Wang, Chen Wu, Rodrigo Tobar, Mohsim, Shaikh, Hui Deng, Wei Dai, Bo Liang, Andreas Wicenec

TL;DR
This paper evaluates the DALiuGE execution framework's performance and usability for processing large-scale astronomical data, highlighting its strengths in software integration and limitations in execution efficiency compared to MPI.
Contribution
It provides an empirical assessment of DALiuGE's performance, developer workload, and implementation complexity in the context of SKA data processing.
Findings
DALiuGE enables rapid integration of astronomical software.
Significant efficiency differences exist across parallel granularities.
DALiuGE lags behind MPI in execution performance even after optimization.
Abstract
The Square Kilometre Array (SKA) project is an international cooperation project to build the largest radio telescope worldwide. Data processing is one of the biggest challenges of building the SKA telescope. As a distributed execution framework, the Data Activated Liu Graph Engine (DALiuGE) was proposed to be one of the candidates for addressing the massive data of the SKA. DALiuGE has many distinctive features, but its actual ability to handle scientific data is still not evident. In this paper, we perform an objective evaluation of the usability of DALiuGE concerning the execution performance, developer workload, and implementation difficulty of porting the SAGECal to DALiuGE. The evaluation results showed that the DALiuGE enables fast integration of astronomical software, but there are significant differences in the efficiency of different parallel granularities. Even with the deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Algorithms and Data Compression · Embedded Systems Design Techniques
