TL;DR
CRANE employs Hidden Markov Models to detect and correct errors in raw nanopore sequencing signals, enhancing the accuracy of downstream genomic analyses without significant computational costs.
Contribution
This work introduces CRANE, a novel HMM-based method for error correction in raw nanopore signals, improving analysis accuracy and pipeline robustness across technologies.
Findings
CRANE consistently improves raw signal analysis accuracy.
It reduces the need for pipeline optimization for new nanopore tech.
CRANE adds minimal computational overhead.
Abstract
Nanopore sequencing can read substantially longer sequences of nucleic acid molecules, called reads, than other sequencing methods, which has led to advances in genomic analysis such as the gapless human genome assembly. By analyzing the raw electrical signal reads that nanopore sequencing generates from molecules, existing works can map these reads without translating them into DNA characters (i.e., basecalling), allowing for quick and efficient analysis of sequencing data. However, raw signals often contain errors due to noise and processing errors, which limits the overall accuracy of raw signal analysis. Our goal in this work is to detect and correct errors in raw signals to improve the accuracy of raw signal analyses. To this end, we propose CRANE, a mechanism that trains and utilizes a Hidden Markov Model (HMM) to accurately correct signal errors. Our extensive evaluation on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Nanopore and Nanochannel Transport Studies · RNA and protein synthesis mechanisms
