# Conserved cross-domain protein-to-mRNA ratios enable proteome prediction in microbes

**Authors:** Mengshi Zhang, Changyi Zhang, Anayancy Ramos, Rachel J. Whitaker, Marvin Whiteley

PMC · DOI: 10.1128/mbio.01411-25 · 2025-07-24

## TL;DR

This paper shows that protein-to-mRNA ratios are consistent across microbes, allowing better prediction of protein levels from mRNA data without needing proteomic measurements.

## Contribution

The study introduces conserved cross-domain RNA-to-protein conversion factors for accurate proteome prediction in microbes.

## Key findings

- Protein-to-RNA ratios are conserved across diverse bacterial and archaeal species.
- Conversion factors from one species improve protein predictions in distantly related organisms.
- The method works without requiring organism-specific proteomic data.

## Abstract

Microbial communities are often studied by measuring gene expression (mRNA levels), but translating these data into functional insights is challenging because mRNA abundance does not always predict protein levels. Here, we present a strategy to bridge this gap by deriving gene-specific RNA-to-protein conversion factors that improve the prediction of protein abundance from transcriptomic data. Using paired mRNA–protein data sets from seven bacteria and one archaeon, we identified orthologous genes where mRNA levels poorly predicted protein abundance, yet each gene’s protein-to-RNA ratio was consistent across these diverse organisms. Applying the resulting conversion factors to mRNA levels dramatically improved protein abundance predictions, even when the conversion factors were obtained from distantly related species. Remarkably, conversion factors derived from bacteria also enhanced protein prediction in an archaeon, demonstrating the robustness of this approach. This cross-domain framework enables more accurate functional inference in microbiomes without requiring organism-specific proteomic data, offering a powerful new tool for microbial ecology, systems biology, and functional genomics.

Deciphering the biology of natural microbial communities is limited by the lack of functional data. While transcriptomics enables gene expression profiling, mRNA levels often fail to predict protein abundance, the primary indicator of microbial function. Prior studies addressed this by calculating RNA-to-protein (RTP) conversion factors using conserved protein-to-RNA (ptr) ratios across bacterial strains, but their cross-species and cross-domain utility remained unknown. We generated comprehensive transcriptomic and proteomic data sets from seven bacteria and one archaeon spanning diverse metabolisms and ecological niches. We identified orthologous genes with conserved ptr ratios, enabling the discovery of RTP conversion factors that significantly improved protein prediction from mRNA, even between distant species and domains. This reveals previously unrecognized conservation in ptr ratios across domains and eliminates the need for paired proteomic data in many cases. Our approach offers a broadly applicable framework to enhance functional prediction in microbiomes using only transcriptomic data.

## Linked entities

- **Species:** Bacteria (taxon 2), archaeon (taxon 1906665)

## Full-text entities

- **Diseases:** CF (MESH:D003550), ptr (MESH:D012327), infected (MESH:D007239)
- **Chemicals:** salt (MESH:D012492), amino acids (MESH:D000596), ethanol (MESH:D000431), ice (MESH:D007053), menadione (MESH:D024483), hemin (MESH:D006427), PBS (MESH:D007854), CO2 (MESH:D002245), chloroacetamide (MESH:C013874), N2 (MESH:D009584), cysteine (MESH:D003545), methionine (MESH:D008715), tris(2-carboxyethyl)phosphine (MESH:C080938), EasyPep (-)
- **Species:** Aggregatibacter actinomycetemcomitans Y4 (strain) [taxon 1035194], Staphylococcus aureus (species) [taxon 1280], Streptococcus gordonii (species) [taxon 1302], Homo sapiens (human, species) [taxon 9606], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Bacillota (clostridial firmicutes, phylum) [taxon 1239], Porphyromonas gingivalis (species) [taxon 837], Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395], Saccharolobus islandicus M.16.4 (strain) [taxon 426118], Pseudomonadota (proteobacteria, phylum) [taxon 1224], Aggregatibacter actinomycetemcomitans (species) [taxon 714], Pseudomonas aeruginosa (species) [taxon 287], Saccharolobus islandicus (species) [taxon 43080]
- **Mutations:** C-78 C
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12345168/full.md

---
Source: https://tomesphere.com/paper/PMC12345168