Long non-coding RNAs as a source of new peptides
Jorge Ruiz-Orera, Xavier Messeguer, Juan A. Subirana, M.Mar Alb\`a

TL;DR
This study reveals that many long non-coding RNAs contain translated open reading frames, suggesting they may serve as a source for new peptides and play a role in de novo protein evolution across multiple species.
Contribution
It provides evidence that lncRNAs can encode small peptides, highlighting their potential role in the emergence of new proteins, which was previously underappreciated.
Findings
Ribosome profiling shows translated ORFs in many lncRNAs.
Most ORFs are shorter than 100 amino acids.
Ribosome density patterns resemble those of protein-coding genes.
Abstract
Deep transcriptome sequencing has revealed the existence of many transcripts that lack long or conserved open reading frames and which have been termed long non-coding RNAs (lncRNAs). Despite the existence of several well-characterized lncRNAs that play roles in the regulation of gene expression, the vast majority of them do not yet have a known function. Motivated by the existence of ribosome profiling data for several species, we have tested the hypothesis that they may act as a repository for the synthesis of new peptides using data from human, mouse, zebrafish, fruit fly, Arabidopsis and yeast. The ribosome protection patterns are consistent with the presence of translated open reading frames (ORFs) in a very large number of lncRNAs. Most of the ribosome-protected ORFs are shorter than 100 amino acids and usually cover less than half the transcript. Ribosome density in these ORFs is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
