A Brief Review of Machine Learning Techniques for Protein Phosphorylation Sites Prediction
Farzaneh Esmaili, Mahdi Pourmirzaei, Shahin Ramazi, Elham Yavari

TL;DR
This paper reviews machine learning techniques for predicting protein phosphorylation sites, covering databases, methods, feature extraction, and evaluates online tools showing their limited performance on new data.
Contribution
It provides a comprehensive overview of phosphorylation site prediction methods, datasets, feature extraction techniques, and evaluates the performance of online tools on recent proteins.
Findings
Online tools perform poorly on new phosphorylation data.
Two main ML approaches: conventional and End-to-End learning.
Reviewed databases and feature extraction techniques for p-sites prediction.
Abstract
Post-translational modifications (PTMs) have vital roles in extending the functional diversity of proteins and as a result, regulating diverse cellular processes in prokaryotic and eukaryotic organisms. Phosphorylation modification is a vital PTM that occurs in most proteins and plays significant roles in many biological processes. Disorders in the phosphorylation process lead to multiple diseases including neurological disorders and cancers. At first, this study comprehensively reviewed all databases related to phosphorylation sites (p-sites). Secondly, we introduced all steps regarding dataset creation, data preprocessing and method evaluation in p-sites prediction. Next, we investigated p-sites prediction methods which fall into two computational and Machine Learning (ML) groups. Additionally, it was shown that there are basically two main approaches for p-sites prediction by ML:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Genomics and Phylogenetic Studies · RNA and protein synthesis mechanisms
