SCOP: A Sequence-Structure Contrast-Aware Framework for Protein Function   Prediction

Runze Ma; Chengxin He; Huiru Zheng; Xinye Wang; Haiying Wang; Yidan; Zhang; Lei Duan

arXiv:2411.11366·q-bio.BM·November 19, 2024·BIBM

SCOP: A Sequence-Structure Contrast-Aware Framework for Protein Function Prediction

Runze Ma, Chengxin He, Huiru Zheng, Xinye Wang, Haiying Wang, Yidan, Zhang, Lei Duan

PDF

Open Access 1 Repo

TL;DR

SCOP is a contrast-aware pre-training framework that integrates protein sequence and structure information to improve function prediction, achieving better results with less data.

Contribution

It introduces a novel contrast-aware pre-training framework that combines sequence and structure views for enhanced protein function prediction.

Findings

01

Outperforms existing methods on multiple datasets.

02

Requires less pre-training data for effective results.

03

Effectively integrates sequence and structural information.

Abstract

Improving the ability to predict protein function can potentially facilitate research in the fields of drug discovery and precision medicine. Technically, the properties of proteins are directly or indirectly reflected in their sequence and structure information, especially as the protein function is largely determined by its spatial properties. Existing approaches mostly focus on protein sequences or topological structures, while rarely exploiting the spatial properties and ignoring the relevance between sequence and structure information. Moreover, obtaining annotated data to improve protein function prediction is often time-consuming and costly. To this end, this work proposes a novel contrast-aware pre-training framework, called SCOP, for protein function prediction. We first design a simple yet effective encoder to integrate the protein topological and spatial features under the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mrzzmrzz/scop
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · Bioinformatics and Genomic Networks

MethodsFocus