# PERFUMES: pipeline to extract RNA functional motifs and exposed structures

**Authors:** Arnaud Chol, Roman Sarrazin-Gendron, Éric Lécuyer, Mathieu Blanchette, Jérôme Waldispühl

PMC · DOI: 10.1093/bioinformatics/btae056 · 2024-01-30

## TL;DR

PERFUMES is a tool that identifies RNA 3D motifs linked to functional features by analyzing RNA sequences and their structural data.

## Contribution

Introduces PERFUMES, a novel pipeline for extracting RNA 3D motifs associated with experimental measurements.

## Key findings

- PERFUMES successfully identified known and new RNA motifs in the SNRPA protein binding site.
- The pipeline uses thermodynamics analysis to interpret RNA structural predictions.
- It is effective for analyzing RNA sequences with binary experimental data.

## Abstract

Up to 75% of the human genome encodes RNAs. The function of many non-coding RNAs relies on their ability to fold into 3D structures. Specifically, nucleotides inside secondary structure loops form non-canonical base pairs that help stabilize complex local 3D structures. These RNA 3D motifs can promote specific interactions with other molecules or serve as catalytic sites.

We introduce PERFUMES, a computational pipeline to identify 3D motifs that can be associated with observable features. Given a set of RNA sequences with associated binary experimental measurements, PERFUMES searches for RNA 3D motifs using BayesPairing2 and extracts those that are over-represented in the set of positive sequences. It also conducts a thermodynamics analysis of the structural context that can support the interpretation of the predictions. We illustrate PERFUMES’ usage on the SNRPA protein binding site, for which the tool retrieved both previously known binder motifs and new ones.

PERFUMES is an open-source Python package (https://jwgitlab.cs.mcgill.ca/arnaud_chol/perfumes).

## Linked entities

- **Proteins:** SNRPA (small nuclear ribonucleoprotein polypeptide A)

## Full-text entities

- **Genes:** SNRPA (small nuclear ribonucleoprotein polypeptide A) [NCBI Gene 6626] {aka Mud1, U1-A, U1A}
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10868343/full.md

---
Source: https://tomesphere.com/paper/PMC10868343