# Identification of robust species-specific marker genes for the demarcation of the Klebsiella pneumoniae–quasipneumoniae–variicola complex

**Authors:** Mustafa Galib, Viet Hung Nguyen

PMC · DOI: 10.1099/jmm.0.002102 · Journal of Medical Microbiology · 2025-11-17

## TL;DR

This study identifies specific genetic markers to accurately and affordably distinguish between closely related Klebsiella species important for clinical diagnosis and treatment.

## Contribution

The paper introduces a panel of species-specific marker genes for rapid and cost-effective differentiation of Klebsiella species.

## Key findings

- GTDB is confirmed as the most accurate taxonomic reference for Klebsiella species classification.
- 22 candidate species-specific marker genes were identified, with four showing high specificity.
- ATP-binding proteins are valuable in multi-locus marker panels despite conserved domains.

## Abstract

Introduction.
Klebsiella pneumoniae, Klebsiella quasipneumoniae and Klebsiella variicola are closely related species (collectively referred to as ‘PQV’) but exhibit distinct clinical presentations and epidemiological profiles. These distinctions are important for proper diagnosis and treatment of infections and other Klebsiella-related diseases. The ability to differentiate between these species is becoming more important as we better understand their unique roles in disease.

Hypothesis/Gap statement. Despite the importance of accurate taxonomic classifications, both traditional and modern laboratory techniques fail to accurately delineate between the PQV species, which can lead to treatment failures. Conversely, taxonomy via whole-genome sequencing is highly accurate but is both costly and resource-intensive. Thus, there is a lack of a widely adopted method that balances both cost-effectiveness and accuracy for differentiating PQV species.

Aim. To assess the taxonomic accuracy of existing genome databases and classification tools for PQV species and to develop a rapid, cost-effective method for accurate species differentiation.

Methodology. To address these challenges, we extracted 78 representative PQV genomes from the Integrated Microbial Genomes and Microbiomes database. We used multiple comparative genomic comparison techniques, phylogenetic tree constructions and pangenome profiling to accurately phylogenetically and subsequently taxonomically classify the genomes. After establishing accurate taxonomic classifications, we identified species-specific marker genes (SSMGs) represented by KEGG Orthologies (KOs).

Results. The Genome Taxonomy Database (GTDB) classifications were consistent with our comparative genomic benchmarks, while the National Center for Biotechnology Informationdatabase originally contained six misclassified genomes. The GTDB-Tk tool also showed reliability for systematic classification of the genomes. Our KO-based screening identified 22 candidate SSMGs. Four markers (K05306, K07507, K13795 and K09955) exhibited significant specificity. ATP-binding proteins showed slightly higher maximum percentage identity values due to conserved domains but were still valuable within multi-locus SSMG panels.

Conclusion. Our findings establish the GTDB as the gold standard taxonomic reference for PQV classification when complete genomes are available. Additionally, we developed a practical panel of genetic markers that enables rapid, cost-effective and accurate species differentiation. These SSMGs represent practical tools that can be implemented in diagnostic laboratories for both clinical specimens and environmental surveillance, addressing a critical gap in clinical microbiology.

## Linked entities

- **Species:** Klebsiella pneumoniae (taxon 573), Klebsiella quasipneumoniae (taxon 1463165), Klebsiella variicola (taxon 244366)

## Full-text entities

- **Diseases:** infections (MESH:D007239)
- **Chemicals:** PQV (-)
- **Species:** Klebsiella variicola (species) [taxon 244366], Klebsiella pneumoniae (species) [taxon 573], Klebsiella quasipneumoniae (species) [taxon 1463165]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12623006/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12623006/full.md

## References

107 references — full list in the complete paper: https://tomesphere.com/paper/PMC12623006/full.md

---
Source: https://tomesphere.com/paper/PMC12623006