Alphabet-dependent Parallel Algorithm for Suffix Tree Construction for   Pattern Searching

Freeson Kaniwa; Venu Madhav Kuthadi; Otlhapile Dinakenyane; Heiko; Schroeder

arXiv:1704.05660·cs.DS·April 20, 2017

Alphabet-dependent Parallel Algorithm for Suffix Tree Construction for Pattern Searching

Freeson Kaniwa, Venu Madhav Kuthadi, Otlhapile Dinakenyane, Heiko, Schroeder

PDF

TL;DR

This paper introduces an alphabet-dependent parallel algorithm for suffix tree construction, significantly improving speed for biological sequence analysis by leveraging multicore architectures, with up to 15x acceleration over sequential methods.

Contribution

The paper presents a novel parallel suffix tree construction algorithm optimized for biological data, exploiting alphabet dependence and multicore systems for enhanced efficiency.

Findings

01

Achieved up to 15x speedup over sequential algorithms

02

Effective for large biological sequences like DNA and proteins

03

Demonstrated efficiency gains on multicore architectures

Abstract

Suffix trees have recently become very successful data structures in handling large data sequences such as DNA or Protein sequences. Consequently parallel architectures have become ubiquitous. We present a novel alphabet-dependent parallel algorithm which attempts to take advantage of the perverseness of the multicore architecture. Microsatellites are important for their biological relevance hence our algorithm is based on time efficient construction for identification of such. We experimentally achieved up to 15x speedup over the sequential algorithm on different input sizes of biological sequences.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.