Theories on PHYlogenetic ReconstructioN (PHYRN)
Gaurav Bhardwaj, Zhenhai Zhang, Yoojin Hong, Kyung Dae Ko, Gue Su, Chang, Evan J. Smith, Lindsay A. Kline, D. Nicholas Hartranft, Edward C., Holmes, Randen L. Patterson, and Damian B. van Rossum

TL;DR
PHYRN is a novel MSA-independent method that uses PSSMs derived from query sequences to improve deep node resolution in phylogenetic trees, especially for rapidly evolving proteins, outperforming traditional MSA-based methods.
Contribution
This paper introduces PHYRN, a new phylogenetic inference method that leverages phylogenetic profiles and PSSMs, offering significant improvements over MSA-dependent approaches for divergent protein families.
Findings
PHYRN achieves 4- to 100-fold more accurate deep node measurements.
PHYRN is scalable to thousands of sequences.
PHYRN effectively resolves evolutionary relationships in rapidly evolving proteins.
Abstract
The inability to resolve deep node relationships of highly divergent/rapidly evolving protein families is a major factor that stymies evolutionary studies. In this manuscript, we propose a Multiple Sequence Alignment (MSA) independent method to infer evolutionary relationships. We previously demonstrated that phylogenetic profiles built using position specific scoring matrices (PSSMs) are capable of constructing informative evolutionary histories(1;2). In this manuscript, we theorize that PSSMs derived specifically from the query sequences used to construct the phylogenetic tree will improve this method for the study of rapidly evolving proteins. To test this theory, we performed phylogenetic analyses of a benchmark protein superfamily (reverse transcriptases (RT)) as well as simulated datasets. When we compare the results obtained from our method, PHYlogenetic ReconstructioN (PHYRN),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Machine Learning in Bioinformatics · RNA and protein synthesis mechanisms
