The Canadian VirusSeq Data Portal & Duotang: open resources for SARS-CoV-2 viral sequences and genomic epidemiology
Erin E. Gill, Baofeng Jia, Carmen Lia Murall, Rapha\"el Poujol,, Muhammad Zohaib Anwar, Nithu Sara John, Justin Richardsson, Ashley Hobb,, Abayomi S. Olabode, Alexandru Lepsa, Ana T. Duggan, Andrea D. Tyler, Arnaud, N'Guessan, Atul Kachru, Brandon Chan, Catherine Yoshida

TL;DR
This paper introduces the Canadian VirusSeq Data Portal and Duotang, open resources providing standardized SARS-CoV-2 genomic data and epidemiological analyses to support research, public health, and international collaboration.
Contribution
It presents a comprehensive, open-access platform and web tool for Canadian SARS-CoV-2 genomic data, with standardized pipelines and dynamic epidemiological visualizations.
Findings
Enhanced, standardized Canadian SARS-CoV-2 genomic data available
Dynamic visualization of variant trends in Canada
Open-source tools supporting public health and research
Abstract
The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN - VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts. The goal of VirusSeq was to allow open access to Canadian SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that were unavailable in other repositories and that meet FAIR standards (Findable, Accessible, Interoperable and Reusable). The Portal data submission pipeline contains data quality checking procedures and appropriate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Machine Learning in Bioinformatics · Genetics, Bioinformatics, and Biomedical Research
