# Large-scale genomic analysis of jumbo phages: coevolution, genome architecture, and host interaction mechanisms

**Authors:** Chao Wei, Zhe Chen, Yaxiang Wang, Lusheng Huang, Congying Chen

PMC · DOI: 10.1186/s42523-026-00534-z · 2026-02-24

## TL;DR

This study explores jumbo phages, large viruses with diverse genomes, and their interactions with hosts and environments using a large genomic dataset.

## Contribution

The study expands the known diversity of jumbo phages and reveals coevolution patterns and host interactions through large-scale genomic analysis.

## Key findings

- 10,754 jumbo phage genomes were identified, with 34.69% representing potentially novel species.
- Jumbo phages coevolve with their habitats and use alternative genetic codes with suppressor tRNAs.
- CRISPR analysis identified potential hosts and competitive interactions among jumbo phages.

## Abstract

Jumbo phages are phages with comparatively large genome sizes. Jumbo phages have been identified in various microbial communities. However, their diversity, genome structure, potential function, and their interactions with hosts and other phages are largely unknown due to insufficient genomic data.

We collected 59,652,008 putative viral genomes from seven habitats by using 38 public metagenome datasets, an integrated public viral genome database (IGN), and pig gut viral genome databases. We obtained 10,754 jumbo phage genomes with sizes ranging from 200 to 831 kb. Most (94.64%) of these jumbo phage genomes were classified into Caudoviricetes, and the results have expanded the known diversity of Caudoviricetes. We found 2,389 species-like operational genome clusters that contained 3,727 (34.69%) genomes without any known viral genomes in the IGN, suggesting potential novel species-like genomes. Genome analysis suggested the potential coevolution of jumbo phages with habitat types and highlighted the utilization of alternative genetic codes and their corresponding suppressor tRNAs for recoding stop codons. CRISPR spacer analysis revealed potential bacterial or archaeal hosts of jumbo phages and uncovered competitive networks among jumbo phages. Habitat type had an important effect on the variation in phage auxiliary metabolic genes.

This study provides an important resource and new knowledge for future studies on the interaction between jumbo phages and their bacterial or archaeal hosts.

The online version contains supplementary material available at 10.1186/s42523-026-00534-z.

## Full-text entities

- **Genes:** TECRL (trans-2,3-enoyl-CoA reductase like) [NCBI Gene 253017] {aka CPVT3, GPSN2L, SRD5A2L2, TERL}, CLPP (caseinolytic mitochondrial matrix peptidase proteolytic subunit) [NCBI Gene 100737271], GLUL (glutamate-ammonia ligase) [NCBI Gene 396944]
- **Diseases:** cholera (MESH:D002771), scarlet fever (MESH:D012541), botulism (MESH:D001906), dysentery (MESH:D004403), VF (MESH:C537182), diphtheria (MESH:D004165), infection (MESH:D007239), AMGs (MESH:D008659), staphylococcal exfoliative skin syndrome (MESH:D013207)
- **Chemicals:** S-adenosyl-L-methionine (MESH:D012436), Nucleotide (MESH:D009711), glycan (MESH:D011134), nitrogen (MESH:D009584), beta-lactam (MESH:D047090), lipid (MESH:D008055), Glycine (MESH:D005998), carbohydrate (MESH:D002241), sulfur (MESH:D013455), cysteine (MESH:D003545), glutamine (MESH:D005973), penicillin (MESH:D010406), methionine (MESH:D008715), carbon (MESH:D002244), S-ribosyl-L-homocysteine (MESH:C487472), sulfate (MESH:D013431), cephalosporins (MESH:D002511), NAD+ (MESH:D009243), S-adenosyIL-homocysteine (-), methane (MESH:D008697)
- **Species:** Streptococcus pneumoniae (species) [taxon 1313], Salmonella enterica (species) [taxon 28901], Porphyromonas gingivalis (species) [taxon 837], Prevotella (genus) [taxon 838], Sus scrofa (pig, species) [taxon 9823], Homo sapiens (human, species) [taxon 9606], Staphylococcus aureus (species) [taxon 1280]
- **Mutations:** glutamine from glutamate
- **Cell lines:** SWFM04 — Homo sapiens (Human), Melanoma, Cancer cell line (CVCL_S856)

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13037316/full.md

---
Source: https://tomesphere.com/paper/PMC13037316