# Integration of bioinformatic tools for the detection of SARS-CoV-2 co-infection cases

**Authors:** Adeliza Mae L. Realingo, Francisco Gerardo M. Polotan, Miguel Francisco B. Abulencia, Roslind Anne R. Pantoni, Jessel Babe G. Capin, Gerald Ivan Sotelo, Maria Carmen A. Corpuz, Neil Tristan M. Yabut, Saul M. Rojas, Ma. Angelica Tujan, Karen Iana Tomas, Ardiane Ysabelle Dolor, Czarina Christelle Alyannah Celis, Stephen Paul Ortia, Ezekiel A. Melo, Chelsea Mae M. Reyes, Elijah Miguel P. Flores, Anne Pauline A. Alpino, Aldwin Kim A. Penales, Kathlene Mae C. Medina, Joanna Ina Manalo, Timothy John R. Dizon, Katie Hampson, Sandeep Kasaragod, Joseph Hughes, Kirstyn Brunker

PMC · DOI: 10.1099/mgen.0.001604 · 2026-01-29

## TL;DR

This paper describes a bioinformatics pipeline that detects SARS-CoV-2 co-infections and variant recombination using genomic data from patient samples.

## Contribution

The study introduces a novel bioinformatics pipeline called Katmon to identify co-infections and recombinant variants in SARS-CoV-2.

## Key findings

- Two probable co-infection cases were identified using the Katmon pipeline, involving Delta and Omicron variants.
- A retrospective analysis of 1,078 samples revealed a lower bound co-infection prevalence of 0.27% and 0.19%.
- The pipeline detected recombinant variants and confirmed co-infections from multiple countries.

## Abstract

Co-infection with multiple severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants, though rare, may have clinical and public health implications, including facilitating variant recombination. Early detection of co-infections is, therefore, crucial. In this study, we report two probable cases of co-infection identified during routine genomic surveillance. Initially suspected as cross-contamination due to the presence of private mutations and nucleotide mixtures flagged by Nextclade and bammix, the samples were re-extracted and re-sequenced after workspace decontamination, yet the anomalies persisted. To investigate further, we developed a bioinformatics pipeline (Katmon) incorporating various tools such as Freyja, with lineage abundance results that illustrated the presence of multiple variants, and VirStrain, which confirmed inconsistent lineage assignments. We also visualized the alternative allele fractions for each lineage-defining mutation and amplicon, showing evidence of two variants, Delta and Omicron, co-existing within a single amplicon. Amplicon sorting effectively separated reads corresponding to the two variants, and the resulting consensus sequences aligned with their respective lineage assignments. These findings suggest that the first sample, PH-RITM-1395, involved a Delta–Omicron co-infection, while the second sample, PH-RITM-4146, probably contains both a co-infection and a recombinant variant. To further support the second sample’s recombinant nature, we employed sc2rf, which identified Delta–Omicron breakpoints. Retrospective analysis of 1,078 samples from July 2021 to July 2022, encompassing the period of co-circulation of different variants in the Philippines, flagged four additional co-infection cases, including Delta–Omicron and Beta–Omicron, suggesting a lower bound co-infection prevalence of 0.27% and 0.19%, respectively. Furthermore, the pipeline was used to test previously identified co-infections of different variants from different countries. Our findings underscore the critical importance of real-time genomic surveillance and advanced bioinformatics pipelines in detecting SARS-CoV-2 co-infections and variant recombination.

## Linked entities

- **Diseases:** SARS-CoV-2 (MONDO:0100096)

## Full-text entities

- **Genes:** GGH (gamma-glutamyl hydrolase) [NCBI Gene 8836] {aka GATD10, GH}, PAH (phenylalanine hydroxylase) [NCBI Gene 5053] {aka PH, PKU, PKU1}, S (surface glycoprotein) [NCBI Gene 43740568] {aka spike glycoprotein}, N (nucleocapsid phosphoprotein) [NCBI Gene 43740575], E (envelope protein) [NCBI Gene 43740570], ORF1ab (ORF1a polyprotein;ORF1ab polyprotein) [NCBI Gene 43740578]
- **Diseases:** infectious diseases (MESH:D003141), PGC (MESH:D019595), AAF (MESH:C536589), NEC (MESH:C536209), Co-infection (MESH:D060085), -infections (MESH:D007239), NCR (MESH:D060048), VOC (MESH:D008881), COVID-19 (MESH:D000086382), severe acute respiratory syndrome 2 (MESH:D045169), GISAID (MESH:D007251), chronic (MESH:D002908), PH-RITM-1395 (MESH:D014947), death (MESH:D003643)
- **Chemicals:** PH-RITM-1395 (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Zeta (genus) [taxon 743421], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]
- **Mutations:** T22882G, D614G, G22992A, S477N, T478K, T22200G, T22917G, V213G, C23604G, K417 N, N679K, G142D, P314L, T23599G, P681R

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12856159/full.md

---
Source: https://tomesphere.com/paper/PMC12856159