Adaptive Computation of the Swap-Insert Correction Distance
J\'er\'emy Barbay, Pablo P\'erez-Lantero

TL;DR
This paper introduces an algorithm to compute the NP-hard swap-insert correction distance between strings, with complexity depending on string length, alphabet size, and character distribution, showing practical cases are often easier.
Contribution
The paper presents a novel algorithm for calculating the swap-insert correction distance with complexity influenced by string and alphabet properties, addressing the NP-hardness challenge.
Findings
Algorithm computes distance within $O(d^2 nm g^{d-1})$ time.
Difficulty measure $g$ bounds the problem's complexity.
Many real-world cases are computationally easier than the worst case.
Abstract
The Swap-Insert Correction distance from a string of length to another string of length on the alphabet is the minimum number of insertions, and swaps of pairs of adjacent symbols, converting into . Contrarily to other correction distances, computing it is NP-Hard in the size of the alphabet. We describe an algorithm computing this distance in time within , where there are occurrences of in , occurrences of in , and where measures the difficulty of the instance. The difficulty is bounded by above by various terms, such as the length of the shortest string , and by the maximum number of occurrences of a single character in . Those results illustrate how, in many cases, the correction distance between two strings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
