Comparative Review of Cloud Computing Platforms for Data Science Workflows
Mohammad Rehman, Hairong Wang

TL;DR
This paper presents a comprehensive framework for evaluating and comparing cloud computing platforms tailored for data science workflows, using multi-criteria decision analysis to assist users in selecting suitable platforms.
Contribution
It introduces a novel evaluation framework combining analytical hierarchy process and specific criteria for cloud platform assessment in data science.
Findings
Framework enables consistent evaluation of cloud platforms
It can recommend suitable cloud platforms based on user-defined criteria
The evaluation process is adaptable to different user requirements
Abstract
With the advantages that cloud computing offers in terms of platform as a service, software as a service, and infrastructure as a service, data engineers and data scientists are able to leverage cloud computing for their ETL/ELT (extract, transform and load) and ML (machine learning) requirements and deployments. The proposed framework for the comparative review of cloud computing platforms for data science workflows uses an amalgamation of the analytical hierarchy process, Saaty's fundamental scale of absolute numbers, and a selection of relevant evaluation criteria (namely: automation, error handling, fault tolerance, performance quality, unit testing, data encryption, monitoring, role based access, security, availability, ease of use, integration and interoperability). The framework enables users to evaluate criteria pertaining to cloud platforms for data science workflows, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Cloud Computing and Resource Management
