Beyond NGS data sharing and towards open science
Bruno Dantas, Calmenelias Fleitas, Alexandre P. Francisco, Jos\'e, Sim\~ao, C\'atia Vaz

TL;DR
This paper introduces NGSPipes, a framework that enables decentralized, platform-independent execution of bioinformatics pipelines, promoting open science and improving reproducibility in biosciences.
Contribution
The paper presents NGSPipes, a novel framework that simplifies building and executing bioinformatics pipelines without centralized servers, enhancing collaboration and reproducibility.
Findings
NGSPipes enables decentralized pipeline execution.
Framework supports platform independence.
Facilitates long-term pipeline archiving and reuse.
Abstract
Biosciences have been revolutionized by next generation sequencing (NGS) technologies in last years, leading to new perspectives in medical, industrial and environmental applications. And although our motivation comes from biosciences, the following is true for many areas of science: published results are usually hard to reproduce either because data is not available or tools are not readily available, which delays the adoption of new methodologies and hinders innovation. Our focus is on tool readiness and pipelines availability. Even though most tools are freely available, pipelines for data analysis are in general barely described and their configuration is far from trivial, with many parameters to be tuned. In this paper we discuss how to effectively build and use pipelines, relying on state of the art computing technologies to execute them without users need to configure, install…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Advanced Data Storage Technologies · Distributed and Parallel Computing Systems
