The ancient Operational Code is embedded in the amino acid substitution matrix and aaRS phylogenies
Julia A. Shore, Barbara R. Holland, Jeremy G. Sumner, Kay Nieselt and, Peter R. Wills

TL;DR
This study reveals that the ancient Operational Code is embedded in the amino acid substitution matrix and phylogenies of aminoacyl-tRNA synthetases, reflecting early evolutionary mechanisms of amino acid recognition and genetic code development.
Contribution
It introduces a parameterized model linking aaRS phylogenies to amino acid recognition properties, elucidating the evolution of the genetic code through structural and functional analysis.
Findings
Parameterization of aaSM fits empirical data better than random models.
A split between polar and nonpolar amino acids improves model accuracy.
Early aaRS evolution reflects recognition of tRNA identity elements.
Abstract
The underlying structure of the canonical amino acid substitution matrix (aaSM) is examined by considering stepwise improvements in the differential recognition of amino acids according to their chemical properties during the branching history of the two aminoacyl-tRNA synthetase (aaRS) superfamilies. The evolutionary expansion of the genetic code is described by a simple parameterization of the aaSM, in which (i) the number of distinguishable amino acid types, (ii) the matrix dimension, and (iii) the number of parameters, each increases by one for each bifurcation in an aaRS phylogeny. Parameterized matrices corresponding to trees in which the size of an amino acid sidechain is the only discernible property behind its categorization as a substrate, exclusively for a Class I or II aaRS, provide a significantly better fit to empirically determined aaSM than trees with random bifurcation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
