Deconvolving RNA Base Pairing Signals
Torin Greenwood, Christine E. Heitsch

TL;DR
This paper investigates the challenge of accurately recovering the relative weightings of multiple stable RNA structures from sequence data, quantifying the difficulty and limitations through theoretical analysis and information measures.
Contribution
It provides a theoretical framework for understanding the limitations of current algorithms in reconstructing RNA conformational ratios, highlighting fundamental constraints.
Findings
Many RNA structure pairs cannot be jointly reconstructed with low total variation distance.
The amount of auxiliary data influences the uncertainty in predicting conformational ratios.
Theoretical bounds are established under the Nussinov-Jacobson model.
Abstract
A growing number of RNA sequences are now known to have distributions of multiple stable sequences. Recent algorithms use the list of nucleotides in a sequence and auxiliary experimental data to predict such distributions. Although the algorithms are largely successful in identifying a distribution's constituent structures, it remains challenging to recover their relative weightings. In this paper, we quantify this issue using a total variation distance. Then, we prove under a Nussinov-Jacobson model that a large proportion of RNA structure pairs cannot be jointly reconstructed with low total variation distance. Finally, we characterize the uncertainty in predicting conformational ratios by analyzing the amount of information in the auxiliary data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Fractal and DNA sequence analysis · Genomics and Phylogenetic Studies
