Valid plane trees: Combinatorial models for RNA secondary structures with Watson-Crick base pairs
Frances Black, Elizabeth Drellich, and Julianna Tymoczko

TL;DR
This paper models RNA secondary structures using augmented plane trees constrained by Watson-Crick base pairing, proving connectivity of the structure graph, providing an algorithm for tree construction, and analyzing enumeration and probability properties.
Contribution
It introduces a new combinatorial model for RNA secondary structures with constraints, along with algorithms and enumeration results, advancing understanding of RNA folding complexity.
Findings
The graph of valid plane trees is connected.
An explicit algorithm constructs valid plane trees from sequences.
The proportion of sequences with valid structures approaches zero as length increases.
Abstract
The combinatorics of RNA plays a central role in biology. Mathematical biologists have several commonly-used models for RNA: words in a fixed alphabet (representing the primary sequence of nucleotides) and plane trees (representing the secondary structure, or folding of the RNA sequence). This paper considers an augmented version of the standard model of plane trees, one that incorporates some observed constraints on how the folding can occur. In particular we assume the alphabet consists of complementary pairs, for instance the Watson-Crick pairs A-U and C-G of RNA. Given a word in the alphabet, a valid plane tree is a tree for which, when the word is folded around the tree, each edge matches two complementary letters. Consider the graph whose vertices are valid plane trees for a fixed word and whose edges are given by Condon, Heitsch, and Hoos's local moves. We prove this graph is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
