Finding Diverse Strings and Longest Common Subsequences in a Graph
Yuto Shida, Giulia Punzi, Yasuaki Kobayashi, Takeaki Uno, and Hiroki, Arimura

TL;DR
This paper investigates the computational complexity of finding diverse longest common subsequences in a set of strings under Hamming distance, introducing new algorithms and complexity results for both bounded and unbounded cases.
Contribution
It is the first to analyze the Diverse LCS problem under Hamming distance, providing polynomial algorithms for bounded cases and NP-hardness with approximation schemes for unbounded cases.
Findings
Polynomial-time solutions for bounded K
NP-hardness for unbounded K
PTAS available for Max-Sum Diverse LCSs
Abstract
In this paper, we study for the first time the Diverse Longest Common Subsequences (LCSs) problem under Hamming distance. Given a set of a constant number of input strings, the problem asks to decide if there exists some subset of longest common subsequences whose diversity is no less than a specified threshold , where we consider two types of diversities of a set of strings of equal length: the Sum diversity and the Min diversity defined as the sum and the minimum of the pairwise Hamming distance between any two strings in , respectively. We analyze the computational complexity of the respective problems with Sum- and Min-diversity measures, called the Max-Sum and Max-Min Diverse LCSs, respectively, considering both approximation algorithms and parameterized complexity. Our results are summarized as follows. When is bounded, both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
