Pi-SAGE: Permutation-invariant surface-aware graph encoder for binding affinity prediction
Sharmi Banerjee, Mostafa Karimi, Melih Yilmaz, Tommi Jaakkola, Bella Dubrov, Shang Shang, Ron Benson

TL;DR
Pi-SAGE introduces a permutation-invariant, surface-aware graph encoder that explicitly encodes protein surface features, significantly improving the prediction of binding affinity changes in protein complexes.
Contribution
The paper presents Pi-SAGE, a novel graph encoder that explicitly encodes protein surface features and enhances structure-based binding affinity prediction models.
Findings
Explicit surface feature encoding improves prediction accuracy.
Augmenting existing models with Pi-SAGE features enhances performance.
Surface-aware encoding outperforms implicit surface learning methods.
Abstract
Protein surface fingerprint encodes chemical and geometric features that govern protein-protein interactions and can be used to predict changes in binding affinity between two protein complexes. Current state-of-the-art models for predicting binding affinity change, such as GearBind, are all-atom based geometric models derived from protein structures. Although surface properties can be implicitly learned from the protein structure, we hypothesize that explicit knowledge of protein surfaces can improve a structure-based model's ability to predict changes in binding affinity. To this end, we introduce Pi-SAGE, a novel Permutation-Invariant Surface-Aware Graph Encoder. We first train Pi-SAGE to create a protein surface codebook directly from the structure and assign a token for each surface-exposed residue. Next, we augment the node features of the GearBind model with surface features from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Computational Drug Discovery Methods · Bioinformatics and Genomic Networks
