SasCsvToolkit -- A versatile parallel 'bag-of-tasks' job submission application on heterogeneous and homogeneous platforms for Big Data Analytics such as for Biomedical Informatics
Abhishek Narain Singh

TL;DR
SasCsvToolkit is a versatile tool enabling efficient conversion and processing of SAS data formats into CSV and vice versa, supporting parallel execution on heterogeneous platforms for big data analytics in biomedical informatics.
Contribution
The paper introduces SasCsvToolkit, a novel toolkit that facilitates SAS data conversion and analysis using parallel 'bag-of-tasks' execution on diverse computing platforms.
Findings
Implemented on SLURM scheduler for parallel processing
Supports conversion between SAS and CSV formats
Provides templates for common database operations
Abstract
Background: The need for big data analysis requires being able to process large data which are being held fine-tuned for usage by corporate. It is only very recently that the need for big data has caught attention for low budget corporate groups and academia who typically do not have money and resources to buy expensive licenses of big data analysis platforms such as SAS. The corporate continue to work on SAS data format largely because of systemic organizational history and that the prior codes have been built on them. The data-providers continue to thus provide data in SAS formats. Acute sudden need has arisen because of this gap of data being in SAS format and the coders not having a SAS expertise or training background as the economic and inertial forces acting of having shaped these two class of people have been different. Method: We analyze the differences and thus the need for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSAS software applications and methods
