Approximating the Median under the Ulam Metric
Diptarka Chakraborty, Debarati Das, Robert Krauthgamer

TL;DR
This paper develops new approximation algorithms for the median string problem under the Ulam metric, surpassing the known 2-approximation barrier and addressing both worst-case and probabilistic models.
Contribution
It introduces algorithms that achieve better than 2-approximation for the median under the Ulam metric, including in specific cases and probabilistic models.
Findings
Breaks the 2-approximation barrier with a (2-δ)-approximate algorithm.
Provides a (2-δ)-approximation for median strings with large objective value.
Designs a high-probability (1+o(1/ε))-approximate median algorithm for perturbed permutations.
Abstract
We study approximation algorithms for variants of the \emph{median string} problem, which asks for a string that minimizes the sum of edit distances from a given set of strings of length . Only the straightforward -approximation is known for this NP-hard problem. This problem is motivated e.g.~by computational biology, and belongs to the class of median problems (over different metric spaces), which are fundamental tasks in data analysis. Our main result is for the Ulam metric, where all strings are permutations over and each edit operation moves a symbol (deletion plus insertion). We devise for this problem an algorithms that breaks the -approximation barrier, i.e., computes a -approximate median permutation for some constant in time . We further use these techniques to achieve a approximation for the median…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
