Asymptotics of Canonical and Saturated RNA Secondary Structures
Peter Clote, Evangelos Kranakis, Danny Krizanc, Bruno Salvy (INRIA, Rocquencourt)

TL;DR
This paper analyzes the asymptotic combinatorial properties of canonical and saturated RNA secondary structures, explaining computational speed-ups and providing new asymptotic counts and distributions for these subclasses.
Contribution
It introduces the asymptotic enumeration of saturated structures, explains the computational speed-up for canonical structures, and presents a stochastic sampling method for saturated structures.
Findings
Asymptotic number of saturated structures is computed.
Expected base pairs in saturated structures is approximately 0.337361 n.
Number of saturated stem-loop structures grows as 0.323954 1.69562^n.
Abstract
It is a classical result of Stein and Waterman that the asymptotic number of RNA secondary structures is . In this paper, we study combinatorial asymptotics for two special subclasses of RNA secondary structures - canonical and saturated structures. Canonical secondary structures were introduced by Bompf\"unewerer et al., who noted that the run time of Vienna RNA Package is substantially reduced when restricting computations to canonical structures. Here we provide an explanation for the speed-up. Saturated secondary structures have the property that no base pairs can be added without violating the definition of secondary structure (i.e. introducing a pseudoknot or base triple). Here we compute the asymptotic number of saturated structures, we show that the asymptotic expected number of base pairs is , and the asymptotic number of saturated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
