# Sequence-Based Models for RNA–Protein Interactions Imputation Might Be Insufficient for Novel Signal Prediction in eCLIP Data

**Authors:** Arsenii K. Rybakov, Daniil A. Khlebnikov, Daria Y. Ovchinnikova, Arina I. Nikolskaya, Arsenii O. Zinkevich, Andrey A. Mironov

PMC · DOI: 10.3390/ijms27031192 · International Journal of Molecular Sciences · 2026-01-24

## TL;DR

This paper introduces a machine learning framework for predicting RNA–protein interactions but finds it insufficient for in vivo data due to experimental biases.

## Contribution

The PLERIO framework uses eCLIP data to reconstruct RNA–protein interactions and is extended to 220 proteins for de novo predictions.

## Key findings

- The PLERIO framework can reconstruct interactions for a single protein using eCLIP data.
- Extrapolating the framework to multiple proteins shows limitations in in vivo IP data.
- The approach may be more suitable for in vitro experiments like RNAcompete.

## Abstract

Predicting specific RNA–protein interactions remains a challenging task. Despite the existence of numerous methods, a unified approach has yet to emerge. Additional difficulties emerge from the properties of in vivo IP experiments and their systematic biases, such as the overrepresentation of highly expressed RNAs. Here, we present the PLERIO machine learning framework, which utilizes eCLIP data for a single protein to reconstruct the full spectrum of its potential interactions with the cellular transcriptome (i.e., both highly expressed and lowly expressed RNAs). In an effort to extrapolate our methodology to a multi-protein paradigm for de novo prediction of RNA–protein interactions on proteins lacking available eCLIP data, we extended our approach to 220 cellular proteins. We then demonstrate that this approach might not be well tailored to the limitations of current in vivo immunoprecipitation data, and may only be meaningful for in vitro experiments such as RNAcompete.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12898063/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12898063/full.md

## References

63 references — full list in the complete paper: https://tomesphere.com/paper/PMC12898063/full.md

---
Source: https://tomesphere.com/paper/PMC12898063