Matching Patterns with Variables under Hamming Distance

Pawe{\l} Gawrychowski; Florin Manea; Stefan Siemer

arXiv:2106.06249·cs.DS·June 14, 2021

Matching Patterns with Variables under Hamming Distance

Pawe{\l} Gawrychowski, Florin Manea, Stefan Siemer

PDF

TL;DR

This paper investigates approximate pattern matching with variables under Hamming distance, providing efficient algorithms for certain classes of patterns and establishing intractability results for more complex cases.

Contribution

It introduces algorithms for approximate matching of variable patterns with Hamming distance and characterizes the complexity boundaries based on pattern structure.

Findings

01

Efficient algorithms for regular patterns with no repeated variables.

02

Matching algorithms for patterns with restricted variable interleaving.

03

Intractability results for patterns with arbitrary variable interleaving.

Abstract

A pattern $α$ is a string of variables and terminal letters. We say that $α$ matches a word $w$ , consisting only of terminal letters, if $w$ can be obtained by replacing the variables of $α$ by terminal words. The matching problem, i.e., deciding whether a given pattern matches a given word, was heavily investigated: it is NP-complete in general, but can be solved efficiently for classes of patterns with restricted structure. In this paper, we approach this problem in a generalized setting, by considering approximate pattern matching under Hamming distance. More precisely, we are interested in what is the minimum Hamming distance between $w$ and any word $u$ obtained by replacing the variables of $α$ by terminal words. Firstly, we address the class of regular patterns (in which no variable occurs twice) and propose efficient algorithms for this problem, as well as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.