Large deviation principles and evolutionary multiple structure alignment of non-coding RNA
Brandon Legried

TL;DR
This paper introduces a probabilistic analysis of the TKF91 Structure Tree for non-coding RNA, establishing large deviation principles and proposing a new alignment method for low-identity sequences in dense phylogenies.
Contribution
It provides a novel probabilistic framework with large deviation principles for RNA structure modeling and a new alignment procedure for challenging sequence comparisons.
Findings
Proved large deviation principles for stem, helix, and tree lengths.
Developed a new alignment method for low-identity sequences.
Enhanced understanding of dynamic folding patterns in non-coding RNA.
Abstract
Non-coding RNA are functional molecules that are not translated into proteins. Their function comes as important regulators of biological function. Because they are not translated, they need not be as stable as other types of RNA. The TKF91 Structure Tree from Holmes 2004 is a probability model that effectively describes correlated substitution, insertion, and deletion of base pairs, and found to have some worth in understanding dynamic folding patterns. In this paper, we provide a new probabilistic analysis of the TKF91 Structure Tree. Large deviation principles on stem lengths, helix lengths, and tree size are proved. Additionally, we give a new alignment procedure that constructs accurate sequence and structural alignments for sequences with low identity for a dense enough phylogeny.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms
MethodsBalanced Selection
