Approximate pattern matching with k-mismatches in packed text
Emanuele Giaquinta, Szymon Grabowski, Kimmo Fredriksson

TL;DR
This paper introduces new algorithms for approximate pattern matching with k-mismatches in packed text, improving efficiency in specific computational models and extending techniques to other related problems.
Contribution
The authors develop faster algorithms for k-mismatch pattern matching in packed text, optimizing performance in the $AC^0$ and word-RAM models, and introduce techniques applicable to other approximate matching problems.
Findings
Improved time complexity bounds for $k$-mismatch pattern matching in packed text.
Algorithms outperform existing bounds for $w = ext{Omega}( ext{log}^{1+ ext{epsilon}} n)$.
Extended techniques to solve other approximate matching problems.
Abstract
Given strings of length and of length over an alphabet of size , the string matching with -mismatches problem is to find the positions of all the substrings in that are at Hamming distance at most from . If can be read only one character at the time the best known bounds are and in the word-RAM model with word length . In the RAM models (including and word-RAM) it is possible to read up to characters in constant time if the characters of are encoded using bits. The only solution for -mismatches in packed text works in time, for any . We present an algorithm that runs in time $O(\frac{n}{\floor{w/(m\log\sigma)}} (1 + \log…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · DNA and Biological Computing
