TL;DR
RNA-FrameFlow is a novel generative model that designs 3D RNA backbones by adapting flow matching techniques from protein modeling, addressing RNA-specific challenges and evaluating structural validity.
Contribution
It introduces the first generative model for 3D RNA backbone design using SE(3) flow matching, with new protocols for data preparation and evaluation tailored to RNA.
Findings
Generates realistic RNA backbones of 40-150 nucleotides
Over 40% of generated RNAs pass validity criteria based on self-consistency
Effective data augmentation improves model diversity
Abstract
We introduce RNA-FrameFlow, the first generative model for 3D RNA backbone design. We build upon SE(3) flow matching for protein backbone generation and establish protocols for data preparation and evaluation to address unique challenges posed by RNA modeling. We formulate RNA structures as a set of rigid-body frames and associated loss functions which account for larger, more conformationally flexible RNA backbones (13 atoms per nucleotide) vs. proteins (4 atoms per residue). Toward tackling the lack of diversity in 3D RNA datasets, we explore training with structural clustering and cropping augmentations. Additionally, we define a suite of evaluation metrics to measure whether the generated RNA structures are globally self-consistent (via inverse folding followed by forward folding) and locally recover RNA-specific structural descriptors. The most performant version of RNA-FrameFlow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
