# Network Representation of Large-Scale Heterogeneous RNA Sequences with   Integration of Diverse Multi-omics, Interactions, and Annotations Data

**Authors:** Nhat Tran, Jean Gao

arXiv: 1906.07289 · 2020-12-10

## TL;DR

This paper introduces rna2rna, a deep learning framework that creates low-dimensional embeddings of RNA sequences to predict interactions, distinguish interaction directionality, and integrate diverse biological datasets, improving RNA interaction prediction accuracy.

## Contribution

The novel rna2rna framework enables inductive, directed, and multi-omics integrated RNA interaction predictions, surpassing existing methods in accuracy and functional annotation discovery.

## Key findings

- Outperforms state-of-the-art in interaction prediction accuracy.
- Successfully predicts interactions for unseen RNA sequences.
- Captures functional and interaction manifolds for RNA sequences.

## Abstract

Long non-coding RNA, microRNA, and messenger RNA enable key regulations of various biological processes through a variety of diverse interaction mechanisms. Identifying the interactions and cross-talk between these heterogeneous RNA classes is essential in order to uncover the functional role of individual RNA transcripts, especially for unannotated and newly-discovered RNA sequences with no known interactions. Recently, sequence-based deep learning and network embedding methods are becoming promising approaches that can either predict RNA-RNA interactions from a sequence or infer missing interactions from patterns that may exist in the network topology. However, the majority of these methods have several limitations, eg, the inability to perform inductive predictions, to distinguish the directionality of interactions, or to integrate various sequence, interaction, and annotation biological datasets. We proposed a novel deep learning-based framework, rna2rna, which learns from RNA sequences to produce a low-dimensional embedding that preserves the proximities in both the interactions topology and the functional affinity topology. In this proposed embedding space, we have designated a two-part" source and target contexts" to capture the targeting and receptive fields of each RNA transcript, while encapsulating the heterogenous cross-talk interactions between lncRNAs and miRNAs. From experimental results, our method exhibits superior performance in AUPR rates compared to state-of-art approaches at predicting missing interactions in different RNA-RNA interaction databases and was shown to accurately perform link predictions to novel RNA sequences not seen at training time, even without any prior information. Additional results suggest that our proposed framework can capture a manifold for heterogeneous RNA sequences to discover novel functional annotations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.07289/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/1906.07289/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1906.07289/full.md

---
Source: https://tomesphere.com/paper/1906.07289