Analysis of Phylogeny Tracking Algorithms for Serial and Multiprocess Applications
Matthew Andres Moreno, Santiago Rodriguez Papa, Emily Dolson

TL;DR
This paper explores algorithms for phylogenetic tracking in both serial and distributed computing, introducing new methods like reference-counting pruning and trie-based reconstruction to improve lineage analysis in biological and digital contexts.
Contribution
It presents novel procedures for phylogenetic analysis tailored to serial and distributed systems, enhancing accuracy and efficiency in lineage reconstruction.
Findings
Reference-counting-based pruning reduces memory usage.
Trie-based approach enables efficient distributed phylogenetic inference.
Methods applicable to digital artifacts like viruses and media.
Abstract
Since the advent of modern bioinformatics, the challenging, multifaceted problem of reconstructing phylogenetic history from biological sequences has hatched perennial statistical and algorithmic innovation. Studies of the phylogenetic dynamics of digital, agent-based evolutionary models motivate a peculiar converse question: how to best engineer tracking to facilitate fast, accurate, and memory-efficient lineage reconstructions? Here, we formally describe procedures for phylogenetic analysis in both serial and distributed computing scenarios. With respect to the former, we demonstrate reference-counting-based pruning of extinct lineages. For the latter, we introduce a trie-based phylogenetic reconstruction approach for "hereditary stratigraphy" genome annotations. This process allows phylogenetic relationships between genomes to be inferred by comparing their similarities, akin to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Data Storage Technologies · Cellular Automata and Applications
