Polynomial-Time Solutions for Longest Common Subsequence Related Problems Between a Sequence and a Pangenome Graph
Xingfu Li, Yongping Wang

TL;DR
This paper introduces polynomial-time algorithms for solving the Longest Common Subsequence and related problems between a sequence and a pangenome graph, advancing genome analysis methods.
Contribution
It provides four polynomial-time reductions transforming LCS variants into longest path problems in DAGs, proving their solvability in polynomial time.
Findings
All four LCS-related problems are in P.
Reductions to longest path in DAGs enable efficient solutions.
Enhances computational tools for pangenome analysis.
Abstract
A pangenome captures the genetic diversity across multiple individuals simultaneously, providing a more comprehensive reference for genome analysis than a single linear genome, which may introduce allele bias. A widely adopted pangenome representation is a node-labeled directed graph, wherein the paths correspond to plausible genomic sequences within a species. Consequently, evaluating sequence-to-pangenome graph similarity constitutes a fundamental task in pangenome construction and analysis. This study explores the Longest Common Subsequence (LCS) problem and three of its variants involving a sequence and a pangenome graph. We present four polynomial-time reductions that transform these LCS-related problems into the longest path problem in a directed acyclic graph (DAG). These reductions demonstrate that all four problems can be solved in polynomial time, establishing their membership…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenome Rearrangement Algorithms · Genomics and Phylogenetic Studies · Genetic Associations and Epidemiology
