RDesign: Hierarchical Data-efficient Representation Learning for Tertiary Structure-based RNA Design
Cheng Tan, Yijie Zhang, Zhangyang Gao, Bozhen Hu, Siyuan Li, Zicheng, Liu, Stan Z. Li

TL;DR
This paper introduces RDesign, a hierarchical, data-efficient learning framework for RNA tertiary structure-based design, leveraging contrastive learning and secondary structure priors to improve design accuracy with limited data.
Contribution
The study presents a novel hierarchical contrastive learning approach for RNA design, incorporating secondary structure priors and a new benchmark dataset, addressing data scarcity and structural complexity.
Findings
Effective in leveraging limited data for RNA design
Outperforms existing methods in accuracy and reliability
Provides a new benchmark dataset for future research
Abstract
While artificial intelligence has made remarkable strides in revealing the relationship between biological macromolecules' primary sequence and tertiary structure, designing RNA sequences based on specified tertiary structures remains challenging. Though existing approaches in protein design have thoroughly explored structure-to-sequence dependencies in proteins, RNA design still confronts difficulties due to structural complexity and data scarcity. Moreover, direct transplantation of protein design methodologies into RNA design fails to achieve satisfactory outcomes although sharing similar structural components. In this study, we aim to systematically construct a data-driven RNA design pipeline. We crafted a large, well-curated benchmark dataset and designed a comprehensive structural modeling approach to represent the complex RNA tertiary structure. More importantly, we proposed a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Genomics and Chromatin Dynamics · Protein Structure and Dynamics
MethodsContrastive Learning · Balanced Selection
