Amino acid metabolism conflicts with protein diversity
Teresa Krick, David A. Shub, Nina Verstraete, Diego U. Ferreiro,, Leonardo G. Alonso, Michael Shub, Ignacio E. Sanchez

TL;DR
This paper presents a model explaining amino acid abundance in proteomes as a trade-off between metabolic cost and sequence diversity, aligning well with observed data across many species.
Contribution
It introduces a simple cost-diversity trade-off model that accounts for amino acid abundances and explains proteome diversity and size.
Findings
Model explains amino acid abundance patterns
Proteomes balance size and diversity optimally
Cost function including chemical decay fits data well
Abstract
The twenty protein coding amino acids are found in proteomes with different relative abundances. The most abundant amino acid, leucine, is nearly an order of magnitude more prevalent than the least abundant amino acid, cysteine. Amino acid metabolic costs differ similarly, constraining their incorporation into proteins. On the other hand, sequence diversity is necessary for protein folding, function and evolution. Here we present a simple model for a cost-diversity trade-off postulating that natural proteomes minimize amino acid metabolic flux while maximizing sequence entropy. The model explains the relative abundances of amino acids across a diverse set of proteomes. We found that the data is remarkably well explained when the cost function accounts for amino acid chemical decay. More than one hundred proteomes reach comparable solutions to the trade-off by different combinations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Proteomics Techniques and Applications · Genomics and Phylogenetic Studies · Machine Learning in Bioinformatics
