# BEREN: a bioinformatic tool for recovering giant viruses, polinton-like viruses, and virophages in metagenomic data

**Authors:** Benjamin Minch, Mohammad Moniruzzaman

PMC · DOI: 10.1093/bioadv/vbaf284 · Bioinformatics Advances · 2025-11-08

## TL;DR

BEREN is a new bioinformatic tool designed to recover and analyze giant viruses and related viruses in metagenomic data.

## Contribution

The novel contribution is BEREN, a comprehensive tool specifically optimized for recovering NCLDV and Preplasmiviricota viruses from metagenomes.

## Key findings

- BEREN outperformed existing tools in recovering NCLDV contigs and Preplasmiviricota genomes from a mock metagenome.
- The tool includes modules for genome recovery, marker gene detection, and metabolic protein annotation.
- BEREN provides a user-friendly solution for studying the ecological roles of eukaryotic viruses.

## Abstract

Viruses in the kingdom Bamfordvirae, specifically giant viruses (NCLDVs) in the phylum Nucleocytoviricota and smaller members in the Preplasmiviricota phylum, are widespread and important groups of viruses that infect eukaryotes. While viruses in this kingdom, such as giant viruses, polinton-like viruses, and virophages, have gained large interest from researchers in recent years, there is still a lack of streamlined tools for the recovery of their genomes from metagenomic datasets.

Here, we present, BEREN, a comprehensive bioinformatic tool to unlock the diversity of these viruses in metagenomes through five modules for NCLDV genome, contig, and marker gene recovery, metabolic protein annotation, and Preplasmiviricota genome identification and annotation. BEREN’s performance was benchmarked against other mainstream virus recovery tools using a mock metagenome, demonstrating superior recovery rates of NCLDV contigs and Preplasmiviricota genomes. Overall, BEREN offers a user-friendly, transparent bioinformatic solution for studying the ecological and functional roles of these eukaryotic viruses, facilitating broader access to their metagenomic analysis.

BEREN is available at https://gitlab.com/benminch1/BEREN, and results from testing BEREN on a real-world metagenome are available in the Supplementary Files.

## Full-text entities

- **Genes:** POLB (DNA polymerase beta) [NCBI Gene 100526021], CD46 (CD46 molecule, complement regulatory protein) [NCBI Gene 396922] {aka MCP}
- **Chemicals:** Preplasmiviricota (-)
- **Species:** Sus scrofa (pig, species) [taxon 9823], Phaeocystis globosa (species) [taxon 33658], Emiliania huxleyi (species) [taxon 2903], Pseudomonas hunanensis (species) [taxon 1247546], Viruses (acellular root) [taxon 10239], PX clade (clade) [taxon 569578]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12638062/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12638062/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12638062/full.md

---
Source: https://tomesphere.com/paper/PMC12638062