Pattern Matching under Weighted Edit Distance
Panagiotis Charalampopoulos, Tomasz Kociumaka, Philip Wellnitz

TL;DR
This paper introduces new algorithms for pattern matching under weighted edit distance, improving efficiency and applicability in real-world scenarios with variable edit costs.
Contribution
It presents three novel algorithms for PMWED with different assumptions on weight functions, including a simple $ ilde{O}(nk)$-time solution and more complex algorithms for metric and arbitrary weights.
Findings
A simple $ ilde{O}(nk)$-time algorithm for PMWED.
A more complex $ ilde{O}(n + k^{3.5} W^4 n/m)$-time algorithm for metric weights.
An $ ilde{O}(n + k^4 n/m)$-time algorithm for arbitrary weights.
Abstract
In Pattern Matching with Weighted Edits (PMWED), we are given a pattern of length , a text of length , a positive threshold , and oracle access to a weight function that specifies the costs of edits (depending on the involved characters, and normalized so that the cost of each edit is at least ). The goal is to compute the starting positions of all fragments of that can be obtained from with edits of total cost at most . PMWED captures typical real-world applications more accurately than its unweighted variant (PMED), where all edits have unit costs. We obtain three main results: (a) a conceptually simple -time algorithm for PMWED, very different from that of Landau and Vishkin for PMED; (b) a significantly more complicated -time algorithm for PMWED under the assumption that the weight function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Graph Theory and Algorithms · Natural Language Processing Techniques
