TL;DR
This paper introduces output-sensitive parallel algorithms for computing edit distance, optimizing performance for similar strings with small edit distances, and compares their theoretical guarantees and practical performance.
Contribution
It presents four novel parallel algorithms for edit distance, including BFS-based and divide-and-conquer approaches, with new data structures for LCP queries and comprehensive empirical evaluation.
Findings
BFS-based algorithms outperform existing implementations in tests.
Different algorithms suit different input patterns, guiding their selection.
The study provides the first systematic analysis of parallel edit distance algorithms.
Abstract
Given two strings and , and a set of operations allowed to edit the strings, the edit distance between and is the minimum number of operations required to transform into . Sequentially, a standard Dynamic Programming (DP) algorithm solves edit distance with cost. In many real-world applications, the strings to be compared are similar and have small edit distances. To achieve highly practical implementations, we focus on output-sensitive parallel edit-distance algorithms, i.e., to achieve asymptotically better cost bounds than the standard algorithm when the edit distance is small. We study four algorithms in the paper, including three algorithms based on Breadth-First Search (BFS) and one algorithm based on Divide-and-Conquer (DaC). Our BFS-based solution is based on the Landau-Vishkin algorithm. We implement three different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
