ViQUF: de novo Viral Quasispecies reconstruction using Unitig-based Flow networks
Borja Freire, Susana Ladra, Jos\'e R. Param\'a, Leena Salmela

TL;DR
ViQUF is a fast, de novo viral quasispecies assembler that reconstructs haplotypes and estimates their frequencies from short-read sequencing data using a novel flow network approach.
Contribution
It introduces a new de novo assembly method combining de Bruijn graphs with flow networks for haplotype reconstruction and frequency estimation.
Findings
At least four times faster than previous methods.
Uses half the memory of existing approaches.
Maintains or exceeds the accuracy of overlap graph-based methods.
Abstract
During viral infection, intrahost mutation and recombination can lead to significant evolution, resulting in a population of viruses that harbor multiple haplotypes. The task of reconstructing these haplotypes from short-read sequencing data is called viral quasispecies assembly, and it can be categorized as a multiassembly problem. We consider the de novo version of the problem, where no reference is available. We present ViQUF, a de novo viral quasispecies assembler that addresses haplotype assembly and quantification. ViQUF obtains a first draft of the assembly graph from a de Bruijn graph. Then, solving a min-cost flow over a flow network built for each pair of adjacent vertices based on their paired-end information creates an approximate paired assembly graph with suggested frequency values as edge labels, which is the first frequency estimation. Then, original haplotypes are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Molecular Biology Techniques and Applications · RNA and protein synthesis mechanisms
