# OpDetect: A convolutional and recurrent neural network classifier for precise and sensitive operon detection from RNA-seq data

**Authors:** Rezvan Karaji, Lourdes Peña-Castillo, Ivan S Petrushin, Ivan S Petrushin, Ivan S Petrushin

PMC · DOI: 10.1371/journal.pone.0329355 · PLOS One · 2025-08-01

## TL;DR

OpDetect is a new tool that uses RNA-seq data to accurately detect operons in prokaryotic genomes and even in some eukaryotes.

## Contribution

OpDetect introduces a species-agnostic deep learning approach for operon detection using RNA-seq data.

## Key findings

- OpDetect outperforms existing methods in recall, F1-score, and AUROC metrics.
- The method works across a wide range of bacterial species and even in C. elegans.
- OpDetect uses a convolutional and recurrent neural network architecture for operon detection.

## Abstract

An operon refers to a group of neighbouring genes belonging to one or more overlapping transcription units that are transcribed in the same direction and have at least one gene in common. Operons are a characteristic of prokaryotic genomes. Identifying which genes belong to the same operon facilitates understanding of gene function and regulation. There are several computational approaches for operon detection; however, many of these computational approaches have been developed for a specific target bacterium or require information only available for a restricted number of bacterial species. Here, we introduce a general method, OpDetect, that directly utilizes RNA-sequencing (RNA-seq) reads as a signal over nucleotide bases in the genome. This representation enabled us to employ a convolutional and recurrent deep neural network architecture which demonstrated superior performance in terms of recall, F1-score and Area under the Receiver-Operating characteristic Curve (AUROC) compared to previous approaches. Additionally, OpDetect showcases species-agnostic capabilities, successfully detecting operons in a wide range of bacterial species and even in Caenorhabditis elegans, one of few eukaryotic organisms known to have operons. OpDetect is available at https://github.com/BioinformaticsLabAtMUN/OpDetect.

## Linked entities

- **Species:** Caenorhabditis elegans (taxon 6239)

## Full-text entities

- **Chemicals:** Ivan S (-)
- **Species:** Borreliella burgdorferi (Lyme disease spirochete, species) [taxon 139], Caenorhabditis elegans (species) [taxon 6239], Staphylococcus aureus (species) [taxon 1280], Bacillus subtilis subsp. subtilis (subspecies) [taxon 135461], Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395], Escherichia coli (E. coli, species) [taxon 562], C. elegans [taxon 328850], Bacillus subtilis (species) [taxon 1423], Helicobacter pylori (species) [taxon 210], Photobacterium profundum (species) [taxon 74109], aureus [taxon 46170], Bradyrhizobium diazoefficiens USDA 110 (strain) [taxon 224911], Photobacterium profundum SS9 (strain) [taxon 298386], Homo sapiens (human, species) [taxon 9606], Yersinia pestis CO92 (strain) [taxon 214092]
- **Cell lines:** MG1655 — Homo sapiens (Human), Maple syrup urine disease, Transformed cell line (CVCL_D514)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12316264/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12316264/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/PMC12316264/full.md

---
Source: https://tomesphere.com/paper/PMC12316264