Random words in free groups, non-crossing matchings and RNA secondary   structures

Siddhartha Gadgil; Manjunath Krishnapur

arXiv:2007.12109·math.GR·January 20, 2022

Random words in free groups, non-crossing matchings and RNA secondary structures

Siddhartha Gadgil, Manjunath Krishnapur

PDF

Open Access

TL;DR

This paper investigates the expected fraction of unpaired bases in random RNA sequences and relates it to properties of words in free groups, establishing convergence to a constant and bounds for it.

Contribution

It establishes the convergence of the unpaired base fraction in random RNA structures and connects this to free group word length ratios, extending results to all non-abelian free groups.

Findings

01

Expected unpaired base fraction converges to a constant between 0 and 1.

02

The ratio of shortest word length in conjugate generators to standard generators grows linearly.

03

Results hold for all non-abelian finitely generated free groups.

Abstract

Consider a random word $X^{n} = (X_{1}, \dots, X_{n})$ in an alphabet consisting of $4$ letters, with the letters viewed either as $A$ , $U$ , $G$ and $C$ (i.e., nucleotides in an RNA sequence) or $α$ , $\overset{α}{ˉ}$ , $β$ and $\overset{ˉ}{β}$ (i.e., generators of the free group $⟨ α, β ⟩$ and their inverses). We show that the expected fraction $ρ (n)$ of unpaired bases in an optimal RNA secondary structure (with only Watson-Crick bonds and no pseudo-knots) converges to a constant $λ_{2}$ with $0 < λ_{2} < 1$ as $n \to \infty$ . Thus, a positive proportion of the bases of a random RNA string do not form hydrogen bonds. We do not know the exact value of $λ_{2}$ , but we derive upper and lower bounds for it. In terms of free groups, $ρ (n)$ is the ratio of the length of the shortest word representing $X$ in the generating set consisting of conjugates of generators…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRNA and protein synthesis mechanisms · RNA Research and Splicing · Genomics and Chromatin Dynamics