# Phage characterization analysis in respiratory samples from infected patients based on metagenomic next-generation sequencing

**Authors:** Yuyan Huang, Qingqing Cai, Yingying Chen, Direerba Amutijiang, Yihan Lu, Weifeng Huang, Ling Li

PMC · DOI: 10.3389/fcimb.2026.1779296 · Frontiers in Cellular and Infection Microbiology · 2026-03-12

## TL;DR

This study uses metagenomic sequencing to analyze phage communities in respiratory samples from infected patients, revealing differences in phage diversity and composition linked to patient outcomes and ICU status.

## Contribution

The study identifies distinct phage community characteristics in respiratory samples and their association with clinical outcomes and ICU admission.

## Key findings

- Sputum and BALF samples showed similar phage richness but distinct community structures.
- ICU patients had reduced phage diversity and altered community structures.
- Phage composition in sputum was significantly associated with patient clinical outcomes.

## Abstract

Respiratory tract infections are common infectious diseases, with microbial dysbiosis closely linked to clinical outcomes in the host. As key regulators of bacteria, phages can influence the structure and stability of microbial communities by infecting host bacteria. Metagenomic next-generation sequencing (mNGS) enables comprehensive analysis of phage community characteristics in clinical samples.

This study included 6,404 clinical samples, comprising 4,837 bronchoalveolar lavage fluids (BALF) and 1,567 sputum samples, for metagenomic next-generation sequencing (mNGS), while collecting patient demographics, sample types, mNGS results, and clinical outcomes. Host-derived sequences were removed post-sequencing and aligned against viral reference databases. Phage community structures across sample types were assessed using alpha and beta diversity metrics. Spearman correlation analysis explored associations between phages and bacteria. Further bioinformatics analysis was performed on 194 samples, including viral sequence assembly and identification using SPAdes, VirSorter2, and PhaMer; CD-HIT clustering and redundancy removal; CheckV quality assessment; PhaTYP lifestyle prediction; Prodigal protein gene annotation; and BLASTP alignment against the CARD database to screen for phage resistance genes.

The sputum and BALF groups exhibited comparable richness, diversity, and evenness, yet their community structures differed significantly. Intensive Care Unit (ICU) admission status was closely associated with reduced phage community diversity and significant alterations in community structure, and the abundance distribution of several phage families (Peduoviridae, Autoscriptoviridae, Casjensviridae, Demerecviridae) also changed significantly. Additionally, the phage community structure in sputum samples was significantly associated with patient clinical outcomes. Correlation analysis demonstrated that the Aliceevansviridae family in sputum samples had extensive positive associations with various bacteria. After assembly, 69.5% of pOTUs were predicted to be temperate phages, and 28.9% were predicted to be virulent phages; moreover, the vast majority (99.2%) of phage sequences showed low similarity to antibiotic resistance genes.

This study identifies distinct phage community characteristics across respiratory sample types and reveals that ICU patients exhibit reduced phage diversity and markedly altered community structures. Furthermore, the phage composition in upper respiratory tract samples shows a clear relationship with patient prognosis, providing new insights into respiratory infection microecology.

## Full-text entities

- **Diseases:** infectious diseases (MESH:D003141), infected (MESH:D007239), Respiratory tract infections (MESH:D012141)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13017799/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13017799/full.md

## References

94 references — full list in the complete paper: https://tomesphere.com/paper/PMC13017799/full.md

---
Source: https://tomesphere.com/paper/PMC13017799