SCOP: A Sequence-Structure Contrast-Aware Framework for Protein Function Prediction
Runze Ma, Chengxin He, Huiru Zheng, Xinye Wang, Haiying Wang, Yidan, Zhang, Lei Duan

TL;DR
SCOP is a contrast-aware pre-training framework that integrates protein sequence and structure information to improve function prediction, achieving better results with less data.
Contribution
It introduces a novel contrast-aware pre-training framework that combines sequence and structure views for enhanced protein function prediction.
Findings
Outperforms existing methods on multiple datasets.
Requires less pre-training data for effective results.
Effectively integrates sequence and structural information.
Abstract
Improving the ability to predict protein function can potentially facilitate research in the fields of drug discovery and precision medicine. Technically, the properties of proteins are directly or indirectly reflected in their sequence and structure information, especially as the protein function is largely determined by its spatial properties. Existing approaches mostly focus on protein sequences or topological structures, while rarely exploiting the spatial properties and ignoring the relevance between sequence and structure information. Moreover, obtaining annotated data to improve protein function prediction is often time-consuming and costly. To this end, this work proposes a novel contrast-aware pre-training framework, called SCOP, for protein function prediction. We first design a simple yet effective encoder to integrate the protein topological and spatial features under the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · Bioinformatics and Genomic Networks
MethodsFocus
