Interpretable Convolutional Neural Networks for Effective Translation Initiation Site Prediction
Jasper Zuallaert, Mijung Kim, Yvan Saeys, Wesley De Neve

TL;DR
This paper introduces an interpretable convolutional neural network model that significantly improves the accuracy of predicting translation initiation sites in genomic data, learning biologically relevant features without prior knowledge.
Contribution
The novel CNN approach enhances prediction accuracy and provides insights into biologically meaningful features learned implicitly by the model.
Findings
75.2% reduction in false positive rate
24.5% decrease in error rate
Model learns biologically relevant features without prior knowledge
Abstract
Thanks to rapidly evolving sequencing techniques, the amount of genomic data at our disposal is growing increasingly large. Determining the gene structure is a fundamental requirement to effectively interpret gene function and regulation. An important part in that determination process is the identification of translation initiation sites. In this paper, we propose a novel approach for automatic prediction of translation initiation sites, leveraging convolutional neural networks that allow for automatic feature extraction. Our experimental results demonstrate that we are able to improve the state-of-the-art approaches with a decrease of 75.2% in false positive rate and with a decrease of 24.5% in error rate on chosen datasets. Furthermore, an in-depth analysis of the decision-making process used by our predictive model shows that our neural network implicitly learns biologically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
