Fast Arc-Annotated Subsequence Matching in Linear Space
Philip Bille, Inge Li Goertz

TL;DR
This paper introduces a linear-space algorithm for arc-preserving subsequence matching in nested arc-annotated strings, crucial for analyzing RNA structures, improving space efficiency while maintaining the same time complexity as previous methods.
Contribution
The paper presents a new algorithm that reduces space complexity from quadratic to linear for arc-preserving subsequence matching in nested arc-annotated strings.
Findings
Achieves $O(nm)$ time complexity with $O(n + m)$ space complexity.
Enables processing of large RNA molecules more efficiently.
Introduces novel techniques applicable to arc-annotated string problems.
Abstract
An arc-annotated string is a string of characters, called bases, augmented with a set of pairs, called arcs, each connecting two bases. Given arc-annotated strings and the arc-preserving subsequence problem is to determine if can be obtained from by deleting bases from . Whenever a base is deleted any arc with an endpoint in that base is also deleted. Arc-annotated strings where the arcs are ``nested'' are a natural model of RNA molecules that captures both the primary and secondary structure of these. The arc-preserving subsequence problem for nested arc-annotated strings is basic primitive for investigating the function of RNA molecules. Gramm et al. [ACM Trans. Algorithms 2006] gave an algorithm for this problem using time and space, where and are the lengths of and , respectively. In this paper we present a new algorithm using time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Genomics and Phylogenetic Studies · RNA modifications and cancer
