Chaining of Maximal Exact Matches in Graphs
Nicola Rizzo, Manuel C\'aceres, Veli M\"akinen

TL;DR
This paper introduces a novel method for chaining maximal exact matches in directed acyclic graphs to efficiently solve the longest common subsequence problem, with a new symmetric formulation and specific time complexity.
Contribution
It presents a new symmetric chaining formulation in DAGs and an algorithm that solves the LCS problem between a query and a graph efficiently.
Findings
Achieves $O(m+n+k^2|V| + |E| + kN ext{log} N)$ time complexity.
Provides a method to encode full MEMs within the chaining process.
Introduces a symmetric formulation of chaining in DAGs.
Abstract
We show how to chain maximal exact matches (MEMs) between a query string and a labeled directed acyclic graph (DAG) to solve the longest common subsequence (LCS) problem between and . We obtain our result via a new symmetric formulation of chaining in DAGs that we solve in time, where , is the total length of node labels, is the minimum number of paths covering the nodes of and is the number of MEMs between and node labels, which we show encode full MEMs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Graph Theory Research · DNA and Biological Computing
