Classification of signaling proteins based on molecular star graph descriptors using Machine Learning models
Carlos Fernandez-Lozano, Ruben F. Cuinas, Jose A. Seoane, Enrique, Fernandez-Blanco, Julian Dorado, Cristian R. Munteanu

TL;DR
This paper develops a machine learning model using molecular star graph descriptors to accurately classify signaling proteins, aiding drug development by predicting signaling activity from protein structures.
Contribution
It introduces a novel topological descriptor-based classification model for signaling proteins, achieving high accuracy with SVM-RFE and Laplacian kernel.
Findings
Achieved an AUROC of 0.961 with the best model.
Successfully predicted signaling activity for proteins of unknown function.
Identified important signaling pathways with high confidence.
Abstract
Signaling proteins are an important topic in drug development due to the increased importance of finding fast, accurate and cheap methods to evaluate new molecular targets involved in specific diseases. The complexity of the protein structure hinders the direct association of the signaling activity with the molecular structure. Therefore, the proposed solution involves the use of protein star graphs for the peptide sequence information encoding into specific topological indices calculated with S2SNet tool. The Quantitative Structure-Activity Relationship classification model obtained with Machine Learning techniques is able to predict new signaling peptides. The best classification model is the first signaling prediction model, which is based on eleven descriptors and it was obtained using the Support Vector Machines - Recursive Feature Elimination (SVM-RFE) technique with the Laplacian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
