BioWorkbench: A High-Performance Framework for Managing and Analyzing Bioinformatics Experiments
Maria Luiza Mondelli, Thiago Magalh\~aes, Guilherme Loss, Michael, Wilde, Ian Foster, Marta Mattoso, Daniel S. Katz, Helio J. C. Barbosa, Ana, Tereza R. Vasconcelos, Kary Oca\~na, Luiz M. R. Gadelha Jr

TL;DR
BioWorkbench is a high-performance framework that manages, analyzes, and visualizes provenance data from large-scale bioinformatics experiments, significantly reducing execution time and enabling enriched scientific analysis.
Contribution
The paper introduces BioWorkbench, a novel framework integrating provenance collection, analysis, and visualization for bioinformatics workflows, with demonstrated scalability and performance improvements.
Findings
Reduces execution time by up to 98% in case studies
Supports scalable management of bioinformatics experiments
Enables enriched analysis with machine learning techniques
Abstract
Advances in sequencing techniques have led to exponential growth in biological data, demanding the development of large-scale bioinformatics experiments. Because these experiments are computation- and data-intensive, they require high-performance computing (HPC) techniques and can benefit from specialized technologies such as Scientific Workflow Management Systems (SWfMS) and databases. In this work, we present BioWorkbench, a framework for managing and analyzing bioinformatics experiments. This framework automatically collects provenance data, including both performance data from workflow execution and data from the scientific domain of the workflow application. Provenance data can be analyzed through a web application that abstracts a set of queries to the provenance database, simplifying access to provenance information. We evaluate BioWorkbench using three case studies: SwiftPhylo,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
