Comprehensive assessment of error correction methods for high-throughput sequencing data
Yun Heo, Gowthami Manikandan, Anand Ramachandran, Deming Chen

TL;DR
This paper introduces SPECTACLE, a comprehensive evaluation framework for error correction tools in high-throughput DNA and RNA sequencing, providing standardized assessment across multiple sequencing technologies.
Contribution
The study develops a novel software package and dataset collection for standardized evaluation of error correction methods in NGS and TGS sequencing data.
Findings
Evaluated 23 error correction tools across diverse datasets
Identified strengths and weaknesses of various correction methods
Provided insights to guide future development of error correction algorithms
Abstract
The advent of DNA and RNA sequencing has revolutionized the study of genomics and molecular biology. Next generation sequencing (NGS) technologies like Illumina, Ion Torrent, SOLiD sequencing etc. have brought about a quick and cheap way to sequence genomes. Recently, third generation sequencing (TGS) technologies like PacBio and Oxford Nanopore Technology (ONT) have also been developed. Different technologies use different underlying methods for sequencing and are prone to different error rates. Though many tools exist for error correction of sequencing data from NGS and TGS methods, no standard method is available yet to evaluate the accuracy and effectiveness of these error-correction tools. In this study, we present a Software Package for Error Correction Tool Assessment on nuCLEic acid sequences (SPECTACLE) providing comprehensive algorithms to evaluate error-correction methods for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
