Approximating the Median under the Ulam Metric

Diptarka Chakraborty; Debarati Das; Robert Krauthgamer

arXiv:2011.00868·cs.DS·November 3, 2020

Approximating the Median under the Ulam Metric

Diptarka Chakraborty, Debarati Das, Robert Krauthgamer

PDF

TL;DR

This paper develops new approximation algorithms for the median string problem under the Ulam metric, surpassing the known 2-approximation barrier and addressing both worst-case and probabilistic models.

Contribution

It introduces algorithms that achieve better than 2-approximation for the median under the Ulam metric, including in specific cases and probabilistic models.

Findings

01

Breaks the 2-approximation barrier with a (2-δ)-approximate algorithm.

02

Provides a (2-δ)-approximation for median strings with large objective value.

03

Designs a high-probability (1+o(1/ε))-approximate median algorithm for perturbed permutations.

Abstract

We study approximation algorithms for variants of the \emph{median string} problem, which asks for a string that minimizes the sum of edit distances from a given set of $m$ strings of length $n$ . Only the straightforward $2$ -approximation is known for this NP-hard problem. This problem is motivated e.g.~by computational biology, and belongs to the class of median problems (over different metric spaces), which are fundamental tasks in data analysis. Our main result is for the Ulam metric, where all strings are permutations over $[n]$ and each edit operation moves a symbol (deletion plus insertion). We devise for this problem an algorithms that breaks the $2$ -approximation barrier, i.e., computes a $(2 - δ)$ -approximate median permutation for some constant $δ > 0$ in time $\tilde{O} (n m^{2} + n^{3})$ . We further use these techniques to achieve a $(2 - δ)$ approximation for the median…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.