# Investigating the nature of prokaryotic genomic island locations within a genome

**Authors:** Reem Aldaihani, Lenwood S. Heath

PMC · DOI: 10.1371/journal.pone.0301172 · 2024-05-02

## TL;DR

This paper explores where genomic islands tend to be located in prokaryotic genomes, finding preferred sites near replication origins and termini.

## Contribution

The study identifies preferred genomic locations for genomic islands in prokaryotes, distinguishing between linear and circular genomes.

## Key findings

- Genomic islands in linear genomes are often located near the origin of replication and terminus regions.
- In circular genomes, genomic islands are predominantly found in the terminus region.
- The distance distribution between genomic islands follows an exponential pattern, indicating preferred locations.

## Abstract

Horizontal gene transfer (HGT) is a powerful evolutionary force that considerably shapes the structure of prokaryotic genomes and is associated with genomic islands (GIs). A GI is a DNA segment composed of transferred genes that can be found within a prokaryotic genome, obtained through HGT. Much research has focused on detecting GIs in genomes, but here we pursue a new course, which is identifying possible preferred locations of GIs in the prokaryotic genome. Here, we identify the locations of the GIs within prokaryotic genomes to examine patterns in those locations. Prokaryotic GIs were analyzed according to the genome structure that they are located in, whether it be a circular or a linear genome. The analytical investigations employed are: (1) studying the GI locations in relation to the origin of replication (oriC); (2) exploring the distances between GIs; and (3) determining the distribution of GIs across the genomes. For each of the investigations, the analysis was performed on all of the GIs in the data set. Moreover, to void bias caused by the distribution of the genomes represented, the GIs in one genome from each species and the GIs of the most frequent species are also analyzed. Overall, the results showed that there are preferred sites for the GIs in the genome. In the linear genomes, these sites are usually located in the oriC region and terminus region, while in the circular genomes, they are located solely in the terminus region. These results also showed that the distance distribution between the GIs is almost exponential, which proves that GIs have preferred sites within genomes. The oriC and termniuns are preferred sites for the GIs and a possible natural explanation for this could be connected to the content of the oriC region. Moreover, the content of the GIs in terms of its protein families was studied and the results demonstrated that the majority of frequent protein families are close to identical in each section.

## Full-text entities

- **Diseases:** Klebsiella pneumoniae (MESH:D007710), GIs (MESH:D007516)
- **Species:** Bordetella pertussis (species) [taxon 520], Acidovorax sp. (species) [taxon 1872122], Acidovorax sp. JS42 (species) [taxon 232721], Salmonella enterica (species) [taxon 28901], Escherichia coli (E. coli, species) [taxon 562], Pseudomonas aeruginosa (species) [taxon 287], Klebsiella pneumoniae (species) [taxon 573]

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11065298/full.md

---
Source: https://tomesphere.com/paper/PMC11065298