# VirDiG: a de novo transcriptome assembler for coronavirus

**Authors:** Minghao Li, Xuaoyu Guo, Jin Zhao

PMC · DOI: 10.1093/bioadv/vbaf075 · Bioinformatics Advances · 2025-04-08

## TL;DR

VirDiG is a new tool for assembling coronavirus transcriptomes, which helps in understanding the virus's biology and improving treatment strategies.

## Contribution

VirDiG introduces a specialized de novo transcriptome assembler tailored for coronaviruses using discontinuous graphs and specific RNA features.

## Key findings

- VirDiG outperforms traditional assemblers in reconstructing coronavirus transcriptomes.
- The tool effectively uses paired-end reads, sequence depth, and codon information for accurate assembly.

## Abstract

The discontinuous transcription mechanism of coronaviruses contributes to their adaptation to different host environments and plays a critical role in their lifecycle. Accurate assembly of coronavirus transcripts is vital for understanding the virus’s biological traits and developing precise prevention and treatment strategies. However, existing de novo assembly algorithms are primarily designed for alternative splicing events in eukaryotes and are not suitable for assembling coronavirus transcriptome, which consists of both genomic RNA and subgenomic mRNAs. Coronavirus transcriptome reconstruction from short reads remains a challenging problem.

In this work, we present VirDiG, a de novo transcriptome assembler specifically designed for coronaviruses. VirDiG utilizes a discontinuous graph to facilitate accurate transcript assembly by incorporating information from paired-end reads, sequence depth, and start and stop codons. Experimental results from both simulated and real datasets show that VirDiG exhibits significant advantages in reconstructing the transcriptome of coronaviruses when compared to traditional de novo assemblers tailored for classical eukaryotic transcriptome assembly.

VirDiG is freely available at https://github.com/Limh616/VirDiG.git.

## Full-text entities

- **Species:** Gammacoronavirus (genus) [taxon 694013]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12034387/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12034387/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12034387/full.md

---
Source: https://tomesphere.com/paper/PMC12034387