CircFormerMoE: An End-to-End Deep Learning Framework for Circular RNA Splice Site Detection and Pairing in Plant Genomes
Tianyou Jiang

TL;DR
CircFormerMoE is a deep learning framework using transformers and mixture-of-experts to predict plant circRNAs directly from genomic DNA, overcoming limitations of existing methods and enabling large-scale discovery.
Contribution
The paper introduces CircFormerMoE, a novel transformer-based deep learning model for plant circRNA prediction directly from DNA sequences, with validated effectiveness across multiple species.
Findings
Accurately predicts plant circRNAs from genomic DNA.
Capable of discovering unannotated circRNAs.
Validated on 10 plant species.
Abstract
Circular RNAs (circRNAs) are important components of the non-coding RNA regulatory network. Previous circRNA identification primarily relies on high-throughput RNA sequencing (RNA-seq) data combined with alignment-based algorithms that detect back-splicing signals. However, these methods face several limitations: they can't predict circRNAs directly from genomic DNA sequences and relies heavily on RNA experimental data; they involve high computational costs due to complex alignment and filtering steps; and they are inefficient for large-scale or genome-wide circRNA prediction. The challenge is even greater in plants, where plant circRNA splice sites often lack the canonical GT-AG motif seen in human mRNA splicing, and no efficient deep learning model with strong generalization capability currently exists. Furthermore, the number of currently identified plant circRNAs is likely far lower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
