Representing Pattern Matching Algorithms by Polynomial-Size Automata
Tobias Marschall, Noemi E. Passing

TL;DR
This paper introduces a general method to construct polynomial-size automata for various pattern matching algorithms, enabling efficient analysis of their text character access distributions.
Contribution
The authors present a unified approach to build automata of size $O(m^3)$ for multiple algorithms, improving over previous exponential size models.
Findings
Automata of size $O(m^3)$ are achievable for several algorithms.
The method enables polynomial-time computation of access distributions.
Most algorithms analyzed now have automata with polynomial size.
Abstract
Pattern matching algorithms to find exact occurrences of a pattern in a text have been analyzed extensively with respect to asymptotic best, worst, and average case runtime. For more detailed analyses, the number of text character accesses performed by an algorithm when searching a random text of length for a fixed pattern has been considered. Constructing a state space and corresponding transition rules (e.g. in a Markov chain) that reflect the behavior of a pattern matching algorithm is a key step in existing analyses of in both the asymptotic () and the non-asymptotic regime. The size of this state space is hence a crucial parameter for such analyses. In this paper, we introduce a general methodology to construct corresponding state spaces and demonstrate that it applies to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · DNA and Biological Computing
