Sequence-based prediction of function site and protein-ligand interaction by a functionally annotated domain profile database
Dengming Ming, Min Han, Xiongbo An

TL;DR
This paper introduces a sequence-based method utilizing a functionally annotated domain profile database, fiDPD, to predict protein functional sites and protein-ligand interactions with high accuracy, aiding in protein function annotation.
Contribution
The study presents a novel sequence-based approach for PLI and PFS prediction using fiDPD, a new database built from SCOP, improving prediction accuracy and conservation insights.
Findings
MCC of 0.66 for PFS prediction
80% recall for PLI prediction
PLIs are conserved during protein evolution
Abstract
Identifying protein functional sites (PFSs) and protein-ligand interactions (PLIs) are critically important in understanding the protein function and the involved biochemical reactions. As large amount of unknown proteins are quickly accumulated in this post-genome era, an urgent task arises to predict PFSs and PLIs at residual level. Nowadays many knowledge-based methods have been well developed for prediction of PFSs, however, accurate methods for PLI prediction are still lacking. In this study, we have presented a new method for prediction of PLIs and PFSs based on sequence of the inquiry protein. The key of the method hinges on a function- and interaction-annotated protein domain profile database, called fiDPD, which was built from the Structural Classification of Proteins (SCOP) database, using a hidden Markov model program. The method was applied to 13 target proteins from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Machine Learning in Bioinformatics · Computational Drug Discovery Methods
