Fast and accurate alignment of long bisulfite-seq reads
Brent S. Pedersen, Kenneth Eyring, Subhajyoti De, Ivana V. Yang and, David A. Schwartz

TL;DR
This paper compares the accuracy and efficiency of bisulfite-sequencing aligners on long reads, introducing bwa-meth, a new tool supporting complex alignments, and evaluates its performance through simulations and real data.
Contribution
It presents the first comprehensive comparison of long-read bisulfite-sequencing aligners and introduces bwa-meth, a novel aligner optimized for longer reads with complex indel support.
Findings
bwa-meth shows high accuracy on long reads
Longer reads improve alignment specificity
Benchmarking reveals best aligners for long bisulfite-seq reads
Abstract
Summary: Longer sequencing reads, with at least 200 bases per template are now common. While traditional aligners have adopted new strategies to improve the mapping of longer reads, aligners specific to bisulfite-sequencing were optimized when much shorter reads were the norm. We sought to perform the first comparison using longer reads to determine which aligners were most accurate and efficient and to evaluate a novel software tool, bwa-meth, built on a traditional mapper that supports insertions, deletions and clipped alignments. We gauge accuracy by comparing the number of on and off-target reads from a targeted sequencing project and by simulations. Availability and Implementation: The benchmarking scripts and the bwa-meth software are available at https://github/com/brentp/bwa-meth/ under the MIT License.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Advanced Proteomics Techniques and Applications · Glycosylation and Glycoproteins Research
