Prediction of Prokaryotic and Eukaryotic Promoters Using Convolutional Deep Learning Neural Networks
Victor Solovyev, Ramzan Umarov

TL;DR
This paper demonstrates that convolutional neural networks can accurately identify prokaryotic and eukaryotic promoters across diverse organisms, outperforming previous methods and requiring no prior knowledge of specific promoter features.
Contribution
The study introduces CNN-based models for promoter prediction in various organisms, achieving high accuracy and generalizability without relying on predefined promoter features.
Findings
CNN models achieved high sensitivity and specificity in promoter classification.
The models outperformed previous promoter prediction tools.
The approach is adaptable to newly sequenced genomes.
Abstract
Accurate computational identification of promoters remains a challenge as these key DNA regulatory regions have variable structures composed of functional motifs that provide gene specific initiation of transcription. In this paper we utilize Convolutional Neural Networks (CNN) to analyze sequence characteristics of prokaryotic and eukaryotic promoters and build their predictive models. We trained the same CNN architecture on promoters of four very distant organisms: human, plant (Arabidopsis), and two bacteria (Escherichia coli and Mycoplasma pneumonia). We found that CNN trained on sigma70 subclass of Escherichia coli promoter gives an excellent classification of promoters and non-promoter sequences (Sn=0.90, Sp=0.96, CC=0.84). The Bacillus subtilis promoters identification CNN model achieves Sn=0.91, Sp=0.95, and CC=0.86. For human and Arabidopsis promoters we employ CNNs for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · RNA and protein synthesis mechanisms · Genomics and Phylogenetic Studies
