Approximating binary longest common subsequence in almost-linear time

Xiaoyu He; Ray Li

arXiv:2211.16660·cs.DS·February 13, 2023

Approximating binary longest common subsequence in almost-linear time

Xiaoyu He, Ray Li

PDF

Open Access

TL;DR

This paper presents a near-linear time algorithm that improves the approximation ratio for binary longest common subsequence, surpassing the simple 1/2-approximation, and extends the result to strings over larger alphabets.

Contribution

It introduces a new approximation algorithm for binary LCS with ratio better than 1/2 in almost-linear time, generalizing previous results to unequal length strings.

Findings

01

Achieves a (rac{1}{2}+ ext{delta})-approximation in n^{1+epsilon} time for binary LCS.

02

Extends approximation results to q-ary strings in near-linear time.

03

Develops a new combinatorial structure lemma for strings based on oscillation patterns.

Abstract

The Longest Common Subsequence (LCS) is a fundamental string similarity measure, and computing the LCS of two strings is a classic algorithms question. A textbook dynamic programming algorithm gives an exact algorithm in quadratic time, and this is essentially best possible under plausible fine-grained complexity assumptions, so a natural problem is to find faster approximation algorithms. When the inputs are two binary strings, there is a simple $\frac{1}{2}$ -approximation in linear time: compute the longest common all-0s or all-1s subsequence. It has been open whether a better approximation is possible even in truly subquadratic time. Rubinstein and Song showed that the answer is yes under the assumption that the two input strings have equal lengths. We settle the question, generalizing their result to unequal length strings, proving that, for any $ε > 0$ , there exists…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · DNA and Biological Computing · semigroups and automata theory