# On the limits of inferring biophysical parameters of RBP-RNA interactions from in vitro RNA Bind’n Seq data

**Authors:** Niels Schlusser, Mihaela Zavolan, Jun Zhang, niels schlusser, Johannes Söding, niels schlusser

PMC · DOI: 10.12688/f1000research.135164.1 · 2023-06-26

## TL;DR

This paper explores the limitations of using RNA Bind’n Seq data to infer how RNA binding proteins interact with RNA in a lab setting.

## Contribution

The paper introduces a thermodynamic model and uses expectation-maximization to infer RBP binding specificities from RNA Bind’n Seq data.

## Key findings

- The model successfully reproduces known RBP specificities and predicts novel, longer binding motifs.
- The model fails to recover all known motifs, suggesting RNA Bind’n Seq data may not fully align with a single biophysical model.

## Abstract

We develop a thermodynamic model describing the binding of RNA binding proteins (RBP) to oligomers in vitro. We apply expectation-maximization to infer the specificity of RBPs, represented as position-specific weight matrices (PWMs), by maximizing the likelihood of RNA Bind’n Seq data from the ENCODE project. We demonstrate that the model can reproduce known specificities for well-studied proteins and that in some cases we predict novel, longer binding motifs. However, the model does not recover all the motifs that are in principle known, indicating that the data is not well explained by a single underlying biophysical model. Our code is publicly available.

## Linked entities

- **Proteins:** RENBP (renin binding protein)

## Full-text entities

- **Genes:** SUGP1 (SURP and G-patch domain containing 1) [NCBI Gene 57794] {aka F23858, RBP, SF4}, MBNL1 (muscleblind like splicing regulator 1) [NCBI Gene 4154] {aka EXP, MBNL}, NME2 (NME/NM23 nucleoside diphosphate kinase 2) [NCBI Gene 4831] {aka NDK2, NDKB, NDPK B, NDPK-B, NDPKB, NM23-H2}, JUN (Jun proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 3725] {aka AP-1, AP1, c-Jun, cJUN, p39}, CPEB1 (cytoplasmic polyadenylation element binding protein 1) [NCBI Gene 64506] {aka CPE-BP1, CPEB, CPEB-1, h-CPEB, hCPEB-1}, EWSR1 (EWS RNA binding protein 1) [NCBI Gene 2130] {aka EWS, EWS-FLI1}, RBP1 (retinol binding protein 1) [NCBI Gene 5947] {aka CRABP-I, CRBP, CRBP1, CRBPI, RBPC, hCRBP1}, PUM1 (pumilio RNA binding family member 1) [NCBI Gene 9698] {aka HSPUM, NEDMSF, PUMH, PUMH1, PUML1, SCA47}, ELAVL4 (ELAV like RNA binding protein 4) [NCBI Gene 1996] {aka HUD, PNEM}, FUS (FUS RNA binding protein) [NCBI Gene 2521] {aka ALS6, ETM4, FUS1, HNRNPP2, POMP75, TLS}, RBFOX3 (RNA binding fox-1 homolog 3) [NCBI Gene 146713] {aka FOX-3, FOX3, HRNBP3, NEUN}, TAF15 (TATA-box binding protein associated factor 15) [NCBI Gene 8148] {aka Npl3, RBP56, TAF2N, TAFII68}, PTBP3 (polypyrimidine tract binding protein 3) [NCBI Gene 9991] {aka ROD1}, RBFOX2 (RNA binding fox-1 homolog 2) [NCBI Gene 23543] {aka FOX2, Fox-2, HNRBP2, HRNBP2, RBM9, RTA}, JTB (jumping translocation breakpoint) [NCBI Gene 10899] {aka HJTB, HSPC222, PAR, hJT}, PTBP1 (polypyrimidine tract binding protein 1) [NCBI Gene 5725] {aka HNRNP-I, HNRNPI, HNRPI, PTB, PTB-1, PTB-T}, CAT (catalase) [NCBI Gene 847], IGF2 (insulin like growth factor 2) [NCBI Gene 3481] {aka C11orf43, GRDF, IGF-II, PP9974, SRS3}, PCBP1 (poly(rC) binding protein 1) [NCBI Gene 5093] {aka HEL-S-85, HNRPE1, HNRPX, hnRNP-E1, hnRNP-X}, HNRNPD (heterogeneous nuclear ribonucleoprotein D) [NCBI Gene 3184] {aka AUF1, AUF1A, HNRPD, P37, hnRNPD0}, HNRNPK (heterogeneous nuclear ribonucleoprotein K) [NCBI Gene 3190] {aka AUKS, CSBP, HNRPK, TUNP}, CLIP1 (CAP-Gly domain containing linker protein 1) [NCBI Gene 6249] {aka CLIP, CLIP-170, CLIP170, CYLN1, RSN}, PUF60 (poly(U) binding splicing factor 60) [NCBI Gene 22827] {aka FIR, RoBPI, SIAHBP1, VRJS}, CELF1 (CUGBP Elav-like family member 1) [NCBI Gene 10658] {aka BRUNOL2, CUG-BP, CUGBP, CUGBP1, EDEN-BP, NAB50}, HNRNPC (heterogeneous nuclear ribonucleoprotein C) [NCBI Gene 3183] {aka HNRNP, HNRPC, MRD74, SNRPC}, NOVA1 (NOVA alternative splicing regulator 1) [NCBI Gene 4857] {aka Nova-1}, RC3H1 (ring finger and CCCH-type domains 1) [NCBI Gene 149041] {aka FHL6, IMDSHY, RNF198, ROQUIN}, ESRP1 (epithelial splicing regulatory protein 1) [NCBI Gene 54845] {aka DFNB109, RBM35A, RMB35A}, HNRNPL (heterogeneous nuclear ribonucleoprotein L) [NCBI Gene 3191] {aka HNRPL, P/OKcl.14, hnRNP-L}, EIF4H (eukaryotic translation initiation factor 4H) [NCBI Gene 7458] {aka WBSCR1, WSCR1, eIF-4H}
- **Diseases:** solid tumors (MESH:D009369), malignant melanoma (MESH:D008545), Ewing's sarcoma (MESH:D012512)
- **Chemicals:** adenine (MESH:D000225), S (MESH:D013455), oligonucleotide (MESH:D009841), C (MESH:D002244), poly(T) (MESH:D011071), di-nucleotide (MESH:D015226), Poly(A) (MESH:D011061), T (MESH:D014316), poly(U) (MESH:D011072), poly(G) (MESH:D011068)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11269977/full.md

---
Source: https://tomesphere.com/paper/PMC11269977