# Knotify_V2.0: Deciphering RNA Secondary Structures with H-Type Pseudoknots and Hairpin Loops

**Authors:** Angelos Kolaitis, Evangelos Makris, Alexandros Anastasios Karagiannis, Panayiotis Tsanakas, Christos Pavlatos

PMC · DOI: 10.3390/genes15060670 · Genes · 2024-05-23

## TL;DR

Knotify_V2.0 is a new framework for predicting RNA structures, especially H-type pseudoknots and hairpin loops, with improved accuracy over previous methods.

## Contribution

The novel contribution is the ability to identify bulges and hairpins within pseudoknot internal loops using a Context-Free Grammar-based approach.

## Key findings

- Knotify_V2.0 accurately identified core base pairings in 70% of pseudoknot sequences.
- It outperformed existing tools in true positives and reduced false negatives, improving Recall and F1-score.
- The framework narrowed the performance gap with Knotty and achieved the highest F1-score.

## Abstract

Accurately predicting the pairing order of bases in RNA molecules is essential for anticipating RNA secondary structures. Consequently, this task holds significant importance in unveiling previously unknown biological processes. The urgent need to comprehend RNA structures has been accentuated by the unprecedented impact of the widespread COVID-19 pandemic. This paper presents a framework, Knotify_V2.0, which makes use of syntactic pattern recognition techniques in order to predict RNA structures, with a specific emphasis on tackling the demanding task of predicting H-type pseudoknots that encompass bulges and hairpins. By leveraging the expressive capabilities of a Context-Free Grammar (CFG), the suggested framework integrates the inherent benefits of CFG and makes use of minimum free energy and maximum base pairing criteria. This integration enables the effective management of this inherently ambiguous task. The main contribution of Knotify_V2.0 compared to earlier versions lies in its capacity to identify additional motifs like bulges and hairpins within the internal loops of the pseudoknot. Notably, the proposed methodology, Knotify_V2.0, demonstrates superior accuracy in predicting core stems compared to state-of-the-art frameworks. Knotify_V2.0 exhibited exceptional performance by accurately identifying both core base pairing that form the ground truth pseudoknot in 70% of the examined sequences. Furthermore, Knotify_V2.0 narrowed the performance gap with Knotty, which had demonstrated better performance than Knotify and even surpassed it in Recall and F1-score metrics. Knotify_V2.0 achieved a higher count of true positives (tp) and a significantly lower count of false negatives (fn) compared to Knotify, highlighting improvements in Prediction and Recall metrics, respectively. Consequently, Knotify_V2.0 achieved a higher F1-score than any other platform. The source code and comprehensive implementation details of Knotify_V2.0 are publicly available on GitHub.

## Linked entities

- **Diseases:** COVID-19 (MONDO:0100096)

## Full-text entities

- **Diseases:** COVID-19 (MESH:D000086382)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11203014/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11203014/full.md

## References

73 references — full list in the complete paper: https://tomesphere.com/paper/PMC11203014/full.md

---
Source: https://tomesphere.com/paper/PMC11203014