A new formulation of protein evolutionary models that account for structural constraints
Andrew J. Bordner, Hans D. Mittelmann

TL;DR
This paper introduces a probabilistic framework based on factor graphs for modeling protein evolution that accounts for structural constraints and site-site correlations, improving data fit over traditional models.
Contribution
The authors develop a novel, computationally tractable model for protein evolution incorporating site interdependence using factor graphs and approximate inference algorithms.
Findings
The new model better fits sequence data than traditional rate matrix models.
Site-site correlations significantly influence amino acid substitution probabilities.
Evolution within homohexameric enzymes shows contributions from contacting subunits.
Abstract
Despite the importance of a thermodynamically stable structure with a conserved fold for protein function, almost all evolutionary models neglect site-site correlations that arise from physical interactions between neighboring amino acid sites. This is mainly due to the difficulty in formulating a computationally tractable model since rate matrices can no longer be used. Here we introduce a general framework, based on factor graphs, for constructing probabilistic models of protein evolution with site interdependence. Conveniently, efficient approximate inference algorithms, like Belief Propagation, can be used to calculate likelihoods for these models. We fit an amino acid substitution model of this type that accounts for both solvent accessibility and site-site correlations. Comparisons of the new model with rate matrix models and a model accounting only for solvent accessibility…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Microbial Metabolic Engineering and Bioproduction · RNA and protein synthesis mechanisms
