Challenges and characterization of a Biological system on Grid by means of the PhyloGrid application
Raul Isea, Esther Montes, Antonio J. Rubio-Montero, Rafael Mayo

TL;DR
This paper introduces PhyloGrid, a new application designed for large-scale phylogenetic analysis of extensive biological datasets, demonstrated through HIV-1 origin studies using thousands of sequences.
Contribution
It presents the development of PhyloGrid, a novel tool enabling large-scale phylogenetic computations on Grid infrastructure for biological research.
Findings
Successfully analyzed 2900 HIV-1 sequences.
Demonstrated feasibility of large-scale phylogenetic workflows.
Enabled insights into HIV-1 origin through extensive data analysis.
Abstract
In this work we present a new application that is being developed. PhyloGrid is able to perform large-scale phylogenetic calculations as those that have been made for estimating the phylogeny of all the sequences already stored in the public NCBI database. The further analysis has been focused on checking the origin of the HIV-1 disease by means of a huge number of sequences that sum up to 2900 taxa. Such a study has been able to be done by the implementation of a workflow in Taverna.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Glycosylation and Glycoproteins Research
