ZeroFold: Protein-RNA Binding Affinity Predictions from Pre-Structural Embeddings
Josef Hanke (1), Sebastian Pujalte Ojeda (1), Shengyu Zhang (1), Werngard Czechtizky (2), Leonardo De Maria (2), Michele Vendruscolo (1) ((1) Yusuf Hamied Department of Chemistry, University of Cambridge, UK (2) Medicinal Chemistry, Research, Early Development, Respiratory

TL;DR
ZeroFold leverages pre-structural embeddings from foundation models to accurately predict protein-RNA binding affinity directly from sequences, effectively capturing conformational flexibility without requiring explicit structural data.
Contribution
This work introduces ZeroFold, a transformer-based model that uses pre-structural embeddings to predict binding affinity, addressing the challenge of RNA flexibility in structural biology.
Findings
ZeroFold achieves a Spearman correlation of 0.65 on the test set.
It outperforms existing structure-based and sequence-based predictors, especially with less similar training data.
Pre-structural embeddings effectively encode conformational ensemble information.
Abstract
The accurate prediction of protein-RNA binding affinity remains an unsolved problem in structural biology, limiting opportunities in understanding gene regulation and designing RNA-targeting therapeutics. A central obstacle is the structural flexibility of RNA, as, unlike proteins, RNA molecules exist as dynamic conformational ensembles. Thus, committing to a single predicted structure discards information relevant to binding. Here, we show that this obstacle can be addressed by extracting pre-structural embeddings, which are intermediate representations from a biomolecular foundation model captured before the structure decoding step. Pre-structural embeddings implicitly encode conformational ensemble information without requiring predicted structures. We build ZeroFold, a transformer-based model that combines pre-structural embeddings from Boltz-2 for both protein and RNA molecules…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Protein Structure and Dynamics · Machine Learning in Bioinformatics
