# Exploring the IS-capades of Klebsiella pneumoniae: insertion sequences drive metabolic loss in obscure sub-lineages

**Authors:** Ben Vezina, Claire White, Helena B. Cooper, Kathryn E. Holt, Jane Hawkey, Kelly L. Wyres, Margaret M. C. Lam

PMC · DOI: 10.1099/mgen.0.001612 · 2026-01-22

## TL;DR

This study shows that Klebsiella pneumoniae sub-lineages with high insertion sequence (IS) loads have reduced metabolic capabilities, suggesting IS elements drive metabolic loss during evolution.

## Contribution

The study reveals a novel inverse relationship between IS load and metabolic capacity in specific K. pneumoniae sub-lineages.

## Key findings

- High IS loads in four K. pneumoniae sub-lineages correlate with reduced metabolic gene counts and substrate utilization.
- IS insertion sites are frequently found near genes involved in transcription, carbohydrate, and amino acid metabolism.
- Sub-lineages exhibit convergent losses in phosphorus and carbon source utilization despite maintaining broad metabolic potential.

## Abstract

Introduction.
Klebsiella pneumoniae is an opportunistic pathogen that causes a wide spectrum of infections within healthcare settings and the community. Four K. pneumoniae sub-lineages, defined using core gene multi-locus sequence types, are known to cause distinct infections of the nasal and/or upper respiratory passages: SL91 and SL10031 (also referred to as subspecies ozaenae), SL10032 (subspecies rhinoscleromatis) and SL82. These sub-lineages have also demonstrated reduced carbon source utilization, which, in other species, has been linked with high loads of insertion sequences (ISs).

Methods. We performed comparative genomics, analysed IS composition and loads and constructed genome-scale metabolic models for available public sequences from these four sub-lineages. These were then compared with other sub-lineages from the wider K. pneumoniae population.

Results. The four focal sub-lineages displayed significantly higher IS loads (median range, 88–120 per genome) than other K. pneumoniae sub-lineages (median range, 12–73). Notably, each K. pneumoniae sub-lineage had unique IS profiles, consistent with distinct evolutionary trajectories of IS acquisition and expansion. Across sub-lineages, higher IS loads were inversely associated with the number of metabolic model genes per genome (R2=0.16; P<0.001), as well as predicted aerobic substrate utilization for phosphorus sources (R2=0.39; P<0.001), as per a second-degree polynomial regression model (n=1,664 genomes). Additionally, the four IS-dense sub-lineages displayed a combination of convergent, sub-lineage-specific substrate utilization losses, including the parallel loss of 3-phospho-d-glycerate, d-glycerate-2-phosphate and phosphoenolpyruvate utilization as carbon/phosphorus sources. Finally, inspection of IS insertion sites demonstrated frequent and non-destructive insertion next to transcriptional, carbohydrate and amino acid metabolism genes.

Conclusions. IS accumulation in K. pneumoniae was significantly associated with reduced metabolic substrate usage, consistent with an inverse relationship between IS load and metabolic capacity. Despite these losses, the affected lineages still demonstrate substantial metabolic breadth, consistent with early-stage, ongoing reductive evolution.

## Linked entities

- **Species:** Klebsiella pneumoniae (taxon 573)

## Full-text entities

- **Diseases:** granulomatous disease (MESH:D006105), urinary tract infections (MESH:D014552), infection (MESH:D007239), atrophic rhinitis (MESH:D012222), rhinoscleroma (MESH:D012226), Klebsiella pneumoniae (MESH:D007710), K. pneumoniae (MESH:D011014), inflammatory (MESH:D007249), respiratory infections (MESH:D012141), IS (MESH:C538388)
- **Chemicals:** Salmochelin (MESH:C000630262), cytosine (MESH:D003596), 3-phospho-d-glycerate (-), 3-hydroxyphenylacetic acid (MESH:C008069), putrescine (MESH:D011700), 2-ketoglutarate (MESH:D007656), phosphorus (MESH:D010758), phosphoenolpyruvate (MESH:D010728), iron (MESH:D007501), sulphur (MESH:D013455), nitrogen (MESH:D009584), amino acid (MESH:D000596), myo-inositol hexakisphosphate (MESH:D010833), aerobactin (MESH:C031819), pyruvate (MESH:D019289), d-glucose (MESH:D005947), 4-hydroxybenzoate (MESH:C038193), quinate (MESH:D011801), acetolactate (MESH:C006359), l-histidine (MESH:D006639), ilvIN (MESH:D001977), pentose phosphate (MESH:D010428), yersiniabactin (MESH:C104398), carbon (MESH:D002244), l-tyrosine (MESH:D014443), l-lysine (MESH:D008239), carbohydrate (MESH:D002241), ethanolamine (MESH:D019856)
- **Species:** Gallus gallus (bantam, species) [taxon 9031], Shigella (genus) [taxon 620], Bordetella pertussis (species) [taxon 520], Homo sapiens (human, species) [taxon 9606], Klebsiella pneumoniae (species) [taxon 573], Bos taurus (bovine, species) [taxon 9913]
- **Cell lines:** SL66 — Mus musculus (Mouse), Malignant neoplasms of the mouse mammary gland, Cancer cell line (CVCL_9722), -1 — Mus musculus (Mouse), Hybridoma (CVCL_C7RB), SL82 — Homo sapiens (Human), Ewing sarcoma, Cancer cell line (CVCL_LK91), SL10032 — Homo sapiens (Human), Arteriovenous hemangioma/malformation, Transformed cell line (CVCL_HR61)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12828178/full.md

---
Source: https://tomesphere.com/paper/PMC12828178