Faster Approximate String Matching for Short Patterns
Philip Bille

TL;DR
This paper introduces a faster algorithm for approximate string matching with short patterns, leveraging tabulation and word-level parallelism to improve time bounds for specific pattern lengths.
Contribution
It presents a novel implementation of the Landau-Vishkin algorithm that significantly accelerates approximate string matching for short patterns on standard hardware.
Findings
Improved time complexity for short pattern matching.
Effective use of tabulation and word-level parallelism.
Enhanced performance when pattern length is subexponential in log n.
Abstract
We study the classical approximate string matching problem, that is, given strings and and an error threshold , find all ending positions of substrings of whose edit distance to is at most . Let and have lengths and , respectively. On a standard unit-cost word RAM with word size we present an algorithm using time When is short, namely, or this improves the previously best known time bounds for the problem. The result is achieved using a novel implementation of the Landau-Vishkin algorithm based on tabulation and word-level parallelism.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Network Packet Processing and Optimization · semigroups and automata theory
