SlopMap: a software application tool for quick and flexible identification of similar sequences using exact k-mer matching
Ilya Y. Zhbannikov, Samuel S. Hunter, Matthew L. Settles, James A., Foster

TL;DR
SlopMap is a software tool designed for quick, flexible identification of similar sequences using exact k-mer matching, facilitating integration into bioinformatics pipelines for targeted sequence extraction from NGS data.
Contribution
It introduces SlopMap, a novel utility that enables rapid sequence similarity identification with exact k-mer matching, compatible with existing assembly workflows and preserving flowgram data.
Findings
Supports Roche 454 and Illumina data
Provides compatible output formats for assembly
Enables pipeline integration without extra programming
Abstract
With the advent of Next-Generation (NG) sequencing, it has become possible to sequence an entire genome quickly and inexpensively. However, in some experiments one only needs to extract and assembly a portion of the sequence reads, for example when performing transcriptome studies, sequencing mitochondrial genomes, or characterizing exomes. With the raw DNA-library of a complete genome it would appear to be a trivial problem to identify reads of interest. But it is not always easy to incorporate well-known tools such as BLAST, BLAT, Bowtie, and SOAP directly into a bioinformatics pipelines before the assembly stage, either due to in- compatibility with the assembler's file inputs, or because it is desirable to incorporate information that must be extracted separately. For example, in order to incorporate flowgrams from a Roche 454 sequencer into the Newbler assembler it is necessary to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Advanced Proteomics Techniques and Applications · RNA and protein synthesis mechanisms
