On the Decision Tree Complexity of String Matching
Xiaoyu He, Neng Huang, Xiaoming Sun

TL;DR
This paper investigates the decision tree complexity of string matching, providing a near-complete algorithmic solution, new conditions for pattern evasiveness, and connecting the problem to Skolem's Problem in mathematics.
Contribution
It introduces a nearly complete query algorithm for string matching complexity, surpasses previous bounds, and offers new algebraic conditions for pattern evasiveness.
Findings
Most binary patterns are evasive, requiring querying all characters.
The proposed algorithm nearly determines the decision tree complexity for all patterns.
A novel algebraic condition for pattern evasiveness is established.
Abstract
String matching is one of the most fundamental problems in computer science. A natural problem is to determine the number of characters that need to be queried (i.e. the decision tree complexity) in a string in order to decide whether this string contains a certain pattern. Rivest showed that for every pattern , in the worst case any deterministic algorithm needs to query at least characters, where is the length of the string and is the length of the pattern. He further conjectured that this bound is tight. By using the adversary method, Tuza disproved this conjecture and showed that more than one half of binary patterns are {\em evasive}, i.e. any algorithm needs to query all the characters (see Section 1.1 for more details). In this paper, we give a query algorithm which settles the decision tree complexity of string matching except for a negligible fraction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
