GLProtein: Global-and-Local Structure Aware Protein Representation Learning
Yunqing Liu, Wenqi Fan, Xiaoyong Wei, Qing Li

TL;DR
GLProtein is a novel protein pre-training framework that integrates global structural similarity and local amino acid information, significantly improving bioinformatics prediction tasks.
Contribution
It introduces the first approach combining global and local structural information in protein representation learning.
Findings
Outperforms previous methods in protein-protein interaction prediction
Enhances accuracy in contact prediction tasks
Demonstrates the effectiveness of combined global-local structural encoding
Abstract
Proteins are central to biological systems, participating as building blocks across all forms of life. Despite advancements in understanding protein functions through protein sequence analysis, there remains potential for further exploration in integrating protein structural information. We argue that the structural information of proteins is not only limited to their 3D information but also encompasses information from amino acid molecules (local information) to protein-protein structure similarity (global information). To address this, we propose \textbf{GLProtein}, the first framework in protein pre-training that incorporates both global structural similarity and local amino acid details to enhance prediction accuracy and functional insights. GLProtein innovatively combines protein-masked modelling with triplet structure similarity scoring, protein 3D distance encoding and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsProtein Structure and Dynamics · Machine Learning in Bioinformatics · Bioinformatics and Genomic Networks
