# Improving prediction of bacterial sRNA regulatory targets with expression data

**Authors:** Yildiz Derinkok, Haiqi Wang, Brian Tjaden

PMC · DOI: 10.1093/nargab/lqaf055 · NAR Genomics and Bioinformatics · 2025-05-08

## TL;DR

This paper shows how using gene expression data can improve predictions of which genes are regulated by small RNA molecules in bacteria.

## Contribution

The novel contribution is integrating RNA-seq expression data with machine learning to enhance sRNA target prediction accuracy.

## Key findings

- Integrating co-expressed gene data improves computational prediction of sRNA targets.
- Smaller, high-quality target sets yield better machine learning model performance.
- RNA-seq compendia from E. coli and Salmonella were used to identify co-expressed gene groups.

## Abstract

Small regulatory RNAs (sRNAs) are widespread in bacteria. However, characterizing the targets of sRNA regulation in a way that scales with the increasing number of identified sRNAs has proven challenging. Computational methods offer one means for efficient characterization of sRNA targets, but the sensitivity and precision of such computational methods is limited. Here, we investigate whether publicly available expression data from RNA-seq experiments can improve the accuracy of computational prediction of sRNA regulatory targets. Using compendia of 2143 Escherichia coli RNA-seq samples and 177 Salmonella RNA-seq samples, we identify groups of co-expressed genes in each organism and incorporate this expression information into computational prediction of sRNA targets based on machine learning methods. We find that integrating expression information significantly improves the accuracy of computational results. Further, we observe that computational methods perform better when trained on smaller, higher quality sets of targets rather than on larger, noisier sets of targets identified by high-throughput methods.

## Linked entities

- **Species:** Escherichia coli (taxon 562), Salmonella (taxon 590)

## Full-text entities

- **Species:** Escherichia coli (E. coli, species) [taxon 562], Salmonella (genus) [taxon 590]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12060007/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12060007/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC12060007/full.md

---
Source: https://tomesphere.com/paper/PMC12060007