Efficient Prediction of DNA-Binding Proteins Using Machine Learning
Sokyna Qatawneh, Afaf Alneaimi, Thamer Rawashdeh, Mmohammad Muhairat,, Rami Qahwaji, Stan Ipson

TL;DR
This study develops and compares machine learning models, specifically SVM and neural networks, to predict DNA-binding proteins based on amino acid characteristics, achieving high accuracy with SVM.
Contribution
The paper introduces optimized machine learning models for DNA-binding protein prediction and compares their performance, highlighting the effectiveness of SVM with ANOVA Kernel.
Findings
SVM achieved 86.7% accuracy with high sensitivity and specificity.
Neural network achieved 75.4% accuracy, lower than SVM.
Features like charge, patch size, and amino acid composition are effective for prediction.
Abstract
DNA-binding proteins are a class of proteins which have a specific or general affinity to DNA and include three important components: transcription factors; nucleases, and histones. DNA-binding proteins also perform important roles in many types of cellular activities. In this paper we describe machine learning systems for the prediction of DNA- binding proteins where a Support Vector Machine and a Cascade Correlation Neural Network are optimized and then compared to determine the learning algorithm that achieves the best prediction performance. The information used for classification is derived from characteristics that include overall charge, patch size and amino acids composition. In total 121 DNA- binding proteins and 238 non-binding proteins are used to build and evaluate the system. For SVM using the ANOVA Kernel with Jack-knife evaluation, an accuracy of 86.7% has been achieved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · RNA and protein synthesis mechanisms · Protein Structure and Dynamics
