Efficient design of rna sequences with desired properties, structure, and motifs using a grammar variational autoencoder
Narges Zarnaghinaghsh, Byung-Jun Yoon

TL;DR
This paper introduces RGVAE, a novel variational autoencoder that uses stochastic context-free grammar to efficiently generate RNA sequences with specific structural and motif properties, outperforming previous methods.
Contribution
The paper presents RGVAE, a grammar-based VAE that ensures thermodynamically stable RNA design with targeted features, advancing computational RNA engineering.
Findings
RGVAE effectively generates stable RNA sequences with desired motifs.
It outperforms randomized and standard VAEs in design accuracy.
The method enables efficient exploration of the RNA design space.
Abstract
Designing structurally stable RNA sequences with specific motifs and other desirable properties is an important challenge in bioinformatics. The potential design space increases exponentially with the length of the RNA to be engineered, which makes this a difficult combinatorial optimization problem. In this paper, we propose an RNA grammar variational autoencoder (RGVAE) that can efficiently generate novel RNA sequences with specific target properties. The proposed RGVAE builds on the recently proposed grammar VAE, where we incorporate the stochastic context-free grammar (SCFG) to design strutural RNAs with desired motifs and characteristics. Using the SCFG can ensure that the generated RNA sequence can form a thermodynamically stable secondary structure. Given a RNA sequence, the SCFT is used to find the parse tree, which is represented in a continuous low-dimensional latent space by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer-related molecular mechanisms research
