Library of efficient algorithms for phylogenetic analysis
Luana Silva

TL;DR
This paper introduces a comprehensive, efficient, and extensible library for phylogenetic analysis that integrates all workflow steps, offers high performance, and supports workflow interruption and resumption.
Contribution
The library uniquely combines all four phylogenetic workflow steps with a common API, enabling extensibility, reusability, and improved usability over existing tools.
Findings
Algorithms conform to theoretical time complexity.
NJ algorithms have linear memory complexity.
MST and GCP algorithms have quadratic memory complexity.
Abstract
Evolutionary relationships between species are usually inferred through phylogenetic analysis, which provides phylogenetic trees computed from allelic profiles built by sequencing specific regions of the sequences and abstracting them to categorical indexes. With growing exchanges of people and merchandise, epidemics have become increasingly important, and combining information of country-specific datasets can now reveal unknown spreading patterns. The phylogenetic analysis workflow is mainly composed of four consecutive steps, the distance calculation, distance correction, inference algorithm, and local optimization steps. There are many phylogenetic tools out there, however most implement only some of these steps and serve only one single purpose, such as one type of algorithm. Another problem with these is that they are often hard to use and integrate, since each of them has its own…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genetic diversity and population structure · Chromosomal and Genetic Variations
