On the Benefit of Merging Suffix Array Intervals for Parallel Pattern Matching
Johannes Fischer, Dominik K\"oppl, Florian Kurpicz

TL;DR
This paper introduces parallel algorithms for exact and approximate pattern matching using suffix arrays, focusing on efficient interval merging with a novel data structure, enabling faster processing on parallel architectures.
Contribution
It presents a new data structure for quickly merging suffix array intervals and parallel algorithms for pattern matching, improving efficiency over previous methods.
Findings
Parallel suffix array interval computation in sub-logarithmic time.
Efficient merging of suffix array intervals in constant or near-constant parallel time.
Algorithms handle approximate matching with up to k differences efficiently.
Abstract
We present parallel algorithms for exact and approximate pattern matching with suffix arrays, using a CREW-PRAM with processors. Given a static text of length , we first show how to compute the suffix array interval of a given pattern of length in time for . For approximate pattern matching with differences or mismatches, we show how to compute all occurrences of a given pattern in time, where is the size of the alphabet and . The workhorse of our algorithms is a data structure for merging suffix array intervals quickly: Given the suffix array intervals for two patterns and , we present a data structure for computing the interval of in sequential time, or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
