PromID: human promoter prediction by deep learning
Ramzan Umarov, Hiroyuki Kuwahara, Yu Li, Xin Gao, Victor Solovyev

TL;DR
PromID is a deep learning-based tool that accurately predicts the exact positions of transcription start sites in human genomic sequences, significantly reducing false positives compared to previous methods.
Contribution
This work advances promoter prediction by developing deep learning models that precisely locate TSS positions and iteratively improve discrimination using adaptive negative sets.
Findings
Achieved recall of 0.76, precision of 0.77, MCC of 0.76
Outperformed previous tools like FPROM in accuracy and false positive reduction
Provided a publicly available tool for human promoter prediction
Abstract
Computational identification of promoters is notoriously difficult as human genes often have unique promoter sequences that provide regulation of transcription and interaction with transcription initiation complex. While there are many attempts to develop computational promoter identification methods, we have no reliable tool to analyze long genomic sequences. In this work we further develop our deep learning approach that was relatively successful to discriminate short promoter and non-promoter sequences. Instead of focusing on the classification accuracy, in this work we predict the exact positions of the TSS inside the genomic sequences testing every possible location. We studied human promoters to find effective regions for discrimination and built corresponding deep learning models. These models use adaptively constructed negative set which iteratively improves the models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Chromatin Dynamics · Machine Learning in Bioinformatics · RNA and protein synthesis mechanisms
