Many Flavors of Edit Distance

Sudatta Bhattacharya; Sanjana Dey; Elazar Goldenberg; Michal Kouck\'y

arXiv:2410.09877·cs.DS·October 15, 2024

Many Flavors of Edit Distance

Sudatta Bhattacharya, Sanjana Dey, Elazar Goldenberg, Michal Kouck\'y

PDF

TL;DR

This paper explores the relationships between different string similarity measures, demonstrating how to reduce complex problems over large alphabets to simpler binary cases and transforming indel distance questions into edit distance ones.

Contribution

It provides methods to reduce string similarity problems over arbitrary alphabets to binary alphabets and transforms indel distance questions into edit distance questions.

Findings

01

Reduction of string similarity problems to binary alphabet cases.

02

Transformation of indel distance questions into edit distance questions.

03

Complementing earlier results by Tiskin (2007).

Abstract

Several measures exist for string similarity, including notable ones like the edit distance and the indel distance. The former measures the count of insertions, deletions, and substitutions required to transform one string into another, while the latter specifically quantifies the number of insertions and deletions. Many algorithmic solutions explicitly address one of these measures, and frequently techniques applicable to one can also be adapted to work with the other. In this paper, we investigate whether there exists a standardized approach for applying results from one setting to another. Specifically, we demonstrate the capability to reduce questions regarding string similarity over arbitrary alphabets to equivalent questions over a binary alphabet. Furthermore, we illustrate how to transform questions concerning indel distance into equivalent questions based on edit distance. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.