The divergence time of protein structures modelled by Markov matrices and its relation to the divergence of sequences
Sandun Rajapaksa, Lloyd Allison, Peter J. Stuckey, Maria Garcia de la, Banda, and Arun S. Konagurthu

TL;DR
This paper introduces a Bayesian framework to estimate divergence times of protein structures using Markov matrices, providing more accurate evolutionary insights than sequence-based models, and demonstrates its effectiveness in structure prediction.
Contribution
It develops a novel time-parameterized structural divergence model based on Markov matrices, improving evolutionary analysis and structure prediction accuracy.
Findings
Estimated divergence times from structures outperform sequence-based proxies.
Established a quantitative relationship between structural and sequence divergence.
Achieved competitive secondary structure prediction accuracy.
Abstract
A complete time-parameterized statistical model quantifying the divergent evolution of protein structures in terms of the patterns of conservation of their secondary structures is inferred from a large collection of protein 3D structure alignments. This provides a better alternative to time-parameterized sequence-based models of protein relatedness, that have clear limitations dealing with twilight and midnight zones of sequence relationships. Since protein structures are far more conserved due to the selection pressure directly placed on their function, divergence time estimates can be more accurate when inferred from structures. We use the Bayesian and information-theoretic framework of Minimum Message Length to infer a time-parameterized stochastic matrix (accounting for perturbed structural states of related residues) and associated Dirichlet models (accounting for insertions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Machine Learning in Bioinformatics · Genomics and Phylogenetic Studies
