# cncFinder: A graph-attention-network-based interpretable learning model to identify bifunctional long non-coding RNAs

**Authors:** Qiang Tang, Yang Yu, Min Shen, Lin Zhang, Xu Jia, Juanjuan Kang

PMC · DOI: 10.1016/j.omtn.2025.102812 · Molecular Therapy. Nucleic Acids · 2025-12-26

## TL;DR

cncFinder is a machine learning model that accurately identifies bifunctional long non-coding RNAs using graph-based sequence analysis.

## Contribution

cncFinder introduces a novel graph-attention-network-based approach for predicting bifunctional lncRNAs with high accuracy and interpretability.

## Key findings

- cncFinder outperformed state-of-the-art models in predicting bifunctional lncRNAs.
- The model captured biologically meaningful motifs like start codons and Kozak-like elements.

## Abstract

Certain RNAs exhibit both protein-coding and regulatory non-coding functions, termed bifunctional RNAs or coding and non-coding RNAs. Long non-coding RNAs (lncRNAs), which play crucial roles in gene regulation and cellular processes, represent a major subset of bifunctional RNAs. Accurate identification of bifunctional lncRNAs is critical for advancing RNA biology and uncovering opportunities for biomarker discovery and therapeutic development. Here, we present cncFinder, a graph-attention-network-based model for predicting bifunctional lncRNAs. It transforms lncRNA sequences into k-mer graphs, encodes node features with Word2Vec, and employs graph attention network to capture higher-order sequence dependencies. On the testing dataset, cncFinder achieved superior performance, significantly outperforming state-of-the-art models. Its robustness and broad applicability were further confirmed through validation on cross-species datasets from mouse and fruit fly. Interpretability analysis revealed that cncFinder captured biologically meaningful motifs, including canonical start codons and Kozak-like elements. In a case study of LINC00961, cncFinder precisely detected an experimentally validated translation initiation motif, highlighting its biological relevance. To support broad accessibility, we developed a user-friendly web server. In summary, cncFinder advances predictive accuracy and interpretability, providing a powerful tool for systematic discovery of bifunctional lncRNAs and enabling new insights into RNA multifunctionality.

cncFinder is a graph-attention-network-based framework that transforms lncRNA sequences into k-mer graphs to predict bifunctional lncRNAs with high accuracy and biological interpretability, enabling systematic discovery of bifunctional lncRNAs.

## Linked entities

- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Species:** Drosophila melanogaster (fruit fly, species) [taxon 7227], Mus musculus (house mouse, species) [taxon 10090]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12830212/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12830212/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/PMC12830212/full.md

---
Source: https://tomesphere.com/paper/PMC12830212