The tree reconstruction game: phylogenetic reconstruction using reinforcement learning
Dana Azouri, Oz Granit, Michael Alburquerque, Yishay Mansour, Tal, Pupko, Itay Mayrose

TL;DR
This paper introduces a reinforcement learning approach to phylogenetic tree reconstruction, demonstrating that an RL agent can effectively learn search strategies that produce likelihood scores comparable to traditional methods.
Contribution
It presents a novel reinforcement learning framework for phylogeny reconstruction that does not rely on likelihood calculations at each step, offering a new paradigm for the field.
Findings
Likelihood scores similar to existing software
RL-based search strategies outperform greedy methods
Effective on unseen empirical data with up to 20 sequences
Abstract
We propose a reinforcement-learning algorithm to tackle the challenge of reconstructing phylogenetic trees. The search for the tree that best describes the data is algorithmically challenging, thus all current algorithms for phylogeny reconstruction use various heuristics to make it feasible. In this study, we demonstrate that reinforcement learning can be used to learn an optimal search strategy, thus providing a novel paradigm for predicting the maximum-likelihood tree. Our proposed method does not require likelihood calculation with every step, nor is it limited to greedy uphill moves in the likelihood space. We demonstrate the use of the developed deep-Q-learning agent on a set of unseen empirical data, namely, on unseen environments defined by nucleotide alignments of up to 20 sequences. Our results show that the likelihood scores of the inferred phylogenies are similar to those…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Algorithms and Data Compression · Genome Rearrangement Algorithms
