Matching Patterns with Variables under Hamming Distance
Pawe{\l} Gawrychowski, Florin Manea, Stefan Siemer

TL;DR
This paper investigates approximate pattern matching with variables under Hamming distance, providing efficient algorithms for certain classes of patterns and establishing intractability results for more complex cases.
Contribution
It introduces algorithms for approximate matching of variable patterns with Hamming distance and characterizes the complexity boundaries based on pattern structure.
Findings
Efficient algorithms for regular patterns with no repeated variables.
Matching algorithms for patterns with restricted variable interleaving.
Intractability results for patterns with arbitrary variable interleaving.
Abstract
A pattern is a string of variables and terminal letters. We say that matches a word , consisting only of terminal letters, if can be obtained by replacing the variables of by terminal words. The matching problem, i.e., deciding whether a given pattern matches a given word, was heavily investigated: it is NP-complete in general, but can be solved efficiently for classes of patterns with restricted structure. In this paper, we approach this problem in a generalized setting, by considering approximate pattern matching under Hamming distance. More precisely, we are interested in what is the minimum Hamming distance between and any word obtained by replacing the variables of by terminal words. Firstly, we address the class of regular patterns (in which no variable occurs twice) and propose efficient algorithms for this problem, as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
