Adaptive Computation of the Swap-Insert Correction Distance

J\'er\'emy Barbay; Pablo P\'erez-Lantero

arXiv:1504.07298·cs.DS·June 30, 2015

Adaptive Computation of the Swap-Insert Correction Distance

J\'er\'emy Barbay, Pablo P\'erez-Lantero

PDF

TL;DR

This paper introduces an algorithm to compute the NP-hard swap-insert correction distance between strings, with complexity depending on string length, alphabet size, and character distribution, showing practical cases are often easier.

Contribution

The paper presents a novel algorithm for calculating the swap-insert correction distance with complexity influenced by string and alphabet properties, addressing the NP-hardness challenge.

Findings

01

Algorithm computes distance within $O(d^2 nm g^{d-1})$ time.

02

Difficulty measure $g$ bounds the problem's complexity.

03

Many real-world cases are computationally easier than the worst case.

Abstract

The Swap-Insert Correction distance from a string $S$ of length $n$ to another string $L$ of length $m \geq n$ on the alphabet $[1.. d]$ is the minimum number of insertions, and swaps of pairs of adjacent symbols, converting $S$ into $L$ . Contrarily to other correction distances, computing it is NP-Hard in the size $d$ of the alphabet. We describe an algorithm computing this distance in time within $O (d^{2} nm g^{d - 1})$ , where there are $n_{α}$ occurrences of $α$ in $S$ , $m_{α}$ occurrences of $α$ in $L$ , and where $g = max_{α \in [1.. d]} min {n_{α}, m_{α} - n_{α}}$ measures the difficulty of the instance. The difficulty $g$ is bounded by above by various terms, such as the length of the shortest string $S$ , and by the maximum number of occurrences of a single character in $S$ . Those results illustrate how, in many cases, the correction distance between two strings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.