Identifying the Genetic Basis of Functional Protein Evolution Using Reconstructed Ancestors
Victor Hanson-Smith, Christopher Baker, Alexander Johnson

TL;DR
This paper introduces a novel computational method to identify key amino acid mutations responsible for functional changes in ancestral proteins, validated on known evolutionary cases and outperforming existing techniques.
Contribution
The authors developed a new statistical approach that detects function-shifting mutations in ancestral proteins by analyzing sequence deviations, mutual information, and information content changes.
Findings
Accurately identified known function-shifting mutations with low false positives.
Outperformed existing methods based on evolutionary rate variability.
Validated on multiple protein families with systematic biological investigations.
Abstract
A central challenge in the study of protein evolution is the identification of historic amino acid sequence changes responsible for creating novel functions observed in present-day proteins. To address this problem, we developed a new method to identify and rank amino acid mutations in ancestral protein sequences according to their function-shifting potential. Our approach scans the changes between two reconstructed ancestral sequences in order to find (1) sites with sequence changes that significantly deviate from our model-based probabilistic expectations, (2) sites that demonstrate extreme changes in mutual information, and (3) sites with extreme gains or losses of information content. By taking the overlaps of these statistical signals, the method accurately identifies cryptic evolutionary patterns that are often not obvious when examining only the conservation of modern-day protein…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · RNA and protein synthesis mechanisms · Machine Learning in Bioinformatics
