Efficient algorithms for enumerating maximal common subsequences of two   strings

Miyuji Hirota; Yoshifumi Sakai

arXiv:2307.10552·cs.DS·July 21, 2023·2 cites

Efficient algorithms for enumerating maximal common subsequences of two strings

Miyuji Hirota, Yoshifumi Sakai

PDF

Open Access

TL;DR

This paper introduces multiple efficient algorithms for enumerating all maximal common subsequences of two strings, optimizing preprocessing time, space, and delay for practical applications.

Contribution

It presents three novel algorithms with different space-time trade-offs for enumerating MCSs, including data structures for targeted search without full enumeration.

Findings

01

Cubic-space algorithm preprocesses in cubic time, outputs MCSs in linear time.

02

Quadratic-space algorithm preprocesses in quadratic time, outputs MCSs in linear time.

03

Linear-space algorithm preprocesses in quadratic time, outputs MCSs in linearithmic time.

Abstract

We propose efficient algorithms for enumerating maximal common subsequences (MCSs) of two strings. Efficiency of the algorithms are estimated by the preprocessing-time, space, and delay-time complexities. One algorithm prepares a cubic-space data structure in cubic time to output each MCS in linear time. This data structure can be used to search for particular MCSs satisfying some condition without performing an explicit enumeration. Another prepares a quadratic-space data structure in quadratic time to output each MCS in linear time, and the other prepares a linear-space data structure in quadratic time to output each MCS in linearithmic time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Genome Rearrangement Algorithms · DNA and Biological Computing