A Scalable Trie Building Algorithm for High-Throughput Phyloanalysis of Wafer-Scale Digital Evolution Experiments
Vivaan Singhvi, Joey Wagner, Emily Dolson, Luis Zaman, Matthew Andres Moreno

TL;DR
This paper introduces a highly scalable trie-building algorithm that significantly accelerates large-scale phylogeny reconstruction in digital evolution experiments, enabling analysis of billions of genomes efficiently.
Contribution
The paper presents a novel trie-building algorithm that achieves 300-fold speedup over previous methods, facilitating large-scale phyloanalysis in high-throughput digital evolution.
Findings
Achieved 300-fold speedup for 10,000-tip trees.
Reconstructed phylogenies from 1 billion genomes in under 3 hours.
Enabled large-scale digital evolution analysis with improved throughput.
Abstract
Agent-based simulation platforms play a key role in enabling fast-to-run evolution experiments that can be precisely controlled and observed in detail. Availability of high-resolution snapshots of lineage ancestries from digital experiments, in particular, is key to investigations of evolvability and open-ended evolution, as well as in providing a validation testbed for bioinformatics method development. Ongoing advances in AI/ML hardware accelerator devices, such as the 850,000-processor Cerebras Wafer-Scale Engine (WSE), are poised to broaden the scope of evolutionary questions that can be investigated in silico. However, constraints in memory capacity and locality characteristic of these systems introduce difficulties in exhaustively tracking phylogenies at runtime. To overcome these challenges, recent work on hereditary stratigraphy algorithms has developed space-efficient genetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Genomics and Phylogenetic Studies · Evolution and Genetic Dynamics
