Neural Embeddings for Protein Graphs
Francesco Ceccarelli, Lorenzo Giusti, Sean B. Holden, Pietro Li\`o

TL;DR
This paper introduces a novel framework combining GNNs and LLMs to embed protein graphs into geometric spaces, enabling efficient structure comparison and classification, with significant speed and accuracy improvements over traditional methods.
Contribution
The paper presents a new method for embedding protein graphs that preserves structural distances, integrating sequence and structure information using GNNs and LLMs.
Findings
Achieved a 26% average F1-Score improvement on OOD samples.
Demonstrated significant speed-up over structural alignment methods.
Effective in protein structure classification and related applications.
Abstract
Proteins perform much of the work in living organisms, and consequently the development of efficient computational methods for protein representation is essential for advancing large-scale biological research. Most current approaches struggle to efficiently integrate the wealth of information contained in the protein sequence and structure. In this paper, we propose a novel framework for embedding protein graphs in geometric vector spaces, by learning an encoder function that preserves the structural distance between protein graphs. Utilizing Graph Neural Networks (GNNs) and Large Language Models (LLMs), the proposed framework generates structure- and sequence-aware protein representations. We demonstrate that our embeddings are successful in the task of comparing protein structures, while providing a significant speed-up compared to traditional approaches based on structural alignment.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Machine Learning in Bioinformatics · Protein Structure and Dynamics
