Edge-aware GAT-based protein binding site prediction
Weisen Yang, Hanqing Zhang, Wangren Qiu, Xuan Xiao, Weizhong Lin

TL;DR
This paper introduces an Edge-aware Graph Attention Network that leverages structural and spatial features to accurately predict biomolecular binding sites, demonstrating superior performance and interpretability on benchmark datasets.
Contribution
The study presents a novel edge-aware GAT model incorporating multidimensional structural features and directional information for improved binding site prediction.
Findings
Achieves ROC-AUC of 0.93 on protein-protein binding site datasets.
Outperforms several state-of-the-art methods in accuracy.
Provides a publicly accessible web server for practical application.
Abstract
Accurate identification of protein binding sites is crucial for understanding biomolecular interaction mechanisms and for the rational design of drug targets. Traditional predictive methods often struggle to balance prediction accuracy with computational efficiency when capturing complex spatial conformations. To address this challenge, we propose an Edge-aware Graph Attention Network (Edge-aware GAT) model for the fine-grained prediction of binding sites across various biomolecules, including proteins, DNA/RNA, ions, ligands, and lipids. Our method constructs atom-level graphs and integrates multidimensional structural features, including geometric descriptors, DSSP-derived secondary structure, and relative solvent accessibility (RSA), to generate spatially aware embedding vectors. By incorporating interatomic distances and directional vectors as edge features within the attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Computational Drug Discovery Methods · Machine Learning in Bioinformatics
