Accelerating R-based Analytics on the Cloud
Ishan Patel, Andrew Rau-Chaplin, Blesson Varghese

TL;DR
This paper introduces P2RAC, a platform that simplifies running R-based analytics on cloud infrastructure, enabling analysts to easily manage resources and data for large-scale computations.
Contribution
The paper presents P2RAC, a new command-line toolset that facilitates seamless execution of R analytics on cloud platforms like Amazon EC2.
Findings
P2RAC effectively manages cloud resources for R analytics.
Experimental results confirm feasibility for large-scale problems.
Enables analysts to leverage cloud scalability with minimal effort.
Abstract
This paper addresses how the benefits of cloud-based infrastructure can be harnessed for analytical workloads. Often the software handling analytical workloads is not developed by a professional programmer, but on an ad hoc basis by Analysts in high-level programming environments such as R or Matlab. The goal of this research is to allow Analysts to take an analytical job that executes on their personal workstations, and with minimum effort execute it on cloud infrastructure and manage both the resources and the data required by the job. If this can be facilitated gracefully, then the Analyst benefits from on-demand resources, low maintenance cost and scalability of computing resources, all of which are offered by the cloud. In this paper, a Platform for Parallel R-based Analytics on the Cloud (P2RAC) that is placed between an Analyst and a cloud infrastructure is proposed and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
