Misdetection of frameshifts in SARS-CoV-2 genomes: need for additional harmonisation and efficient monitoring of data workflows
Rok Kogoj, Mauro Petrillo, Samo Zakotnik, Alen Suljič, Miša Korva, Gabriele Leoni

TL;DR
This paper highlights issues with misidentifying SARS-CoV-2 frameshift mutations in sequencing data and calls for better coordination to improve data workflows.
Contribution
The paper reports misidentified frameshift mutations and emphasizes the need for harmonized data workflows in SARS-CoV-2 surveillance.
Findings
Misidentified frameshifts in SARS-CoV-2 genomes can lead to incorrect assumptions about spike and nucleocapsid protein mutations.
Changes in bioinformatics pipelines or wet-lab procedures can introduce unpredictable results in sequencing data.
Improved collaboration between NGS experts, bioinformaticians, and decision-makers is needed for reliable workflows.
Abstract
Five years after the outbreak of the SARS-CoV-2 pandemic in 2020, diagnostic laboratories have moved from massive sequencing of thousands of samples to routine surveillance of SARS-CoV-2 cases, as with all other respiratory viruses. Surveillance remains of paramount importance to prevent a further SARS-CoV-2 surge, as the virus has been shown to mutate rapidly and can render available drugs and vaccines ineffective. During the pandemic, several bioinformatics pipelines and workflows have been developed to streamline analysis, shorten turnaround time and ensure reproducibility. As the number of samples decreases, laboratories are moving towards more flexible sequencing strategies and optimizing the cost per sample. However, workflow redesigns, even if individual steps have proven successful time and time again, can lead to challenges when changes in a bioinformatics pipeline are…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Genomics and Diagnostics · Single-cell and spatial transcriptomics · CRISPR and Genetic Engineering
