On the Average-case Complexity of Pattern Matching with Wildcards
Carl Barton

TL;DR
This paper introduces new average-case algorithms for pattern matching with wildcards, providing the first non-trivial time bounds and demonstrating a complexity separation from exact pattern matching.
Contribution
It presents the first average-case complexity bounds for pattern matching with wildcards and adapts these results for wildcards in both pattern and text.
Findings
First non-trivial average-case time bounds for wildcard pattern matching.
Provable complexity separation between exact and wildcard pattern matching.
Algorithms applicable when wildcards are restricted to pattern or text.
Abstract
Pattern matching with wildcards is the problem of finding all factors of a text of length that match a pattern of length , where wildcards (characters that match everything) may be present. In this paper we present a number of fast average-case algorithms for pattern matching where wildcards are restricted to either the pattern or the text, however, the results are easily adapted to the case where wildcards are allowed in both. We analyse the \textit{average-case} complexity of these algorithms and show the first non-trivial time bounds. These are the first results on the average-case complexity of pattern matching with wildcards which, as a by product, provide with first provable separation in complexity between exact pattern matching and pattern matching with wildcards in the word RAM model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · Natural Language Processing Techniques
