# GenomicGapID: leveraging spatial distribution of conserved genomic sites for broad-spectrum microbial identification

**Authors:** Vishwaratn Asthana, Pallavi Bugga, Clara Elaine Smith, Catherine Wellman, Zachary Dwight, Piyush Ranjan, Erika Martínez Nieves, Robert P. Dickson, J. Scott VanEpps

PMC · DOI: 10.1128/spectrum.02817-24 · Microbiology Spectrum · 2025-03-14

## TL;DR

GenomicGapID is a new method for identifying bacteria that combines the broad scope of sequencing with the speed of PCR by using conserved genomic regions.

## Contribution

GenomicGapID introduces a novel approach using conserved genomic gaps to enable rapid, target-agnostic microbial identification with universal primers.

## Key findings

- Three universal primer sets can identify 189 clinical bacterial pathogens using GenomicGapID.
- Combining amplicon size signatures with melt analysis improves identification accuracy.
- The method covers a majority of microbes found in clinical cultures.

## Abstract

Bacterial detection and identification methods can be broadly classified as either untargeted with expansive taxonomic coverage or targeted with narrow taxonomic focus. Untargeted approaches, such as culture and sequencing, are often time-consuming and/or costly, whereas targeted methods, such as PCR, can offer faster and more cost-effective results but require a priori knowledge of the likely pathogen to select the appropriate assay. GenomicGapID, a novel approach that leverages the spatial distribution of conserved genetic regions across microbial genomes, represents a significant advancement in the field of microbial identification. This technique has the potential to provide the taxonomic breadth of culture and sequencing while maintaining the speed, simplicity, and cost-effectiveness of PCR. By leveraging the conservation and relative positioning of highly conserved coding regions across different species, GenomicGapID enables the development of universal primer sets that amplify the non-conserved gaps between these regions. This creates a unique electrophoretic signature that facilitates rapid and accurate target-agnostic microbial identification. In this study, we apply the principles of GenomicGapID to the critical task of identifying clinical pathogens. We focus on expanding the coverage of a previously developed universal bacterial identification system, which initially targeted the 16s–23s internal transcribed spacer (ITS) region and was capable of discerning 45 pathogens. To enhance this system, we assembled a comprehensive database of 189 clinically relevant bacterial species. We then identified conserved primer binding sites that produce unique amplicon size signatures for each species. While we found that the use of amplicon size signatures alone would require an impractical number of universal primer sets, we demonstrate that this challenge can be effectively mitigated through concurrent melt analysis. Ultimately, we show that just three universal primer sets, guided by the GenomicGapID framework, are sufficient to cover 189 clinical bacterial pathogens, representing a majority of microbes identified in positive cultures in a clinical microbiology setting, with experimental validation of a subset of these pathogens. This study not only enhances the existing universal bacterial identification system but also establishes GenomicGapID as a versatile and powerful tool in microbial diagnostics and beyond, paving the way for new avenues of research in genomics with the potential to advance molecular biology, clinical practice, and public health.

Rapid and accurate microbial identification is critical in both clinical and research settings. Traditional untargeted methods, such as culture and sequencing, are often time-consuming and expensive, while targeted techniques like PCR offer speed and cost-effectiveness but require pre-selection of pathogens. Our work introduces GenomicGapID, a novel bacterial identification system that provides the taxonomic breadth of untargeted methods, coupled with the speed, simplicity, and affordability of targeted PCR-based techniques. By leveraging the gap between conserved genetic regions and analyzing the associated unique electrophoretic and melt analysis signatures, GenomicGapID enables target-agnostic bacterial identification using a parsimonious set of universal primers.

Our work has significant implications not only in clinical microbiology but also in genomics, environmental microbiology, and public health. We believe this manuscript aligns well with the mission of Microbiology Spectrum to publish innovative and impactful research that advances the field of microbial sciences.

## Full-text entities

- **Diseases:** Bacterial pathogen (MESH:D001424)
- **Chemicals:** Lysogeny broth (-), polyacrylamide (MESH:C016679), Na+ (MESH:D012964), AT (MESH:D001246), PBS (MESH:D007854), DMSO (MESH:D004121), glucose (MESH:D005947), salt (MESH:D012492), TE (MESH:D013691), SYTO 9 (MESH:C103389)
- **Species:** Homo sapiens (human, species) [taxon 9606], Borreliella burgdorferi (Lyme disease spirochete, species) [taxon 139], Campylobacter jejuni (species) [taxon 197], Leptospira interrogans serovar Copenhageni (no rank) [taxon 44275], Leptospira interrogans (species) [taxon 173], Escherichia coli (E. coli, species) [taxon 562], Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395], Enterococcus faecium (species) [taxon 1352]
- **Cell lines:** ATCC 25922 — Homo sapiens (Human), Finite cell line (CVCL_LK64)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12054053/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12054053/full.md

## References

20 references — full list in the complete paper: https://tomesphere.com/paper/PMC12054053/full.md

---
Source: https://tomesphere.com/paper/PMC12054053