Structure-Aware Compound-Protein Affinity Prediction via Graph Neural Network with Group Lasso Regularization
Zanyu Shi, Yang Wang, Pathum Weerawarna, Jie Zhang, Timothy Richardson, Yijie Wang, Kun Huang

TL;DR
This paper introduces a graph neural network framework with group lasso regularization for predicting compound-protein affinity, especially focusing on activity cliffs, improving both accuracy and interpretability in drug discovery.
Contribution
The study presents a novel GNN-based approach with structure-aware loss functions and regularizations to enhance prediction accuracy and explainability for activity cliff data in drug discovery.
Findings
Improved affinity prediction accuracy with reduced RMSE and higher PCC.
Enhanced feature attribution and interpretability of molecular substructures.
Effective identification of critical molecular substructures influencing activity.
Abstract
Explainable artificial intelligence (XAI) approaches have been increasingly applied in drug discovery to learn molecular representations and identify substructures driving property predictions. However, building end-to-end explainable models for structure-activity relationship (SAR) modeling for compound property prediction faces many challenges, such as the limited number of compound-protein interaction activity data for specific protein targets, and plenty of subtle changes in molecular configuration sites significantly affecting molecular properties. We exploit pairs of molecules with activity cliffs that share scaffolds but differ at substituent sites, characterized by large potency differences for specific protein targets. We propose a framework by implementing graph neural networks (GNNs) to leverage property and structure information from activity cliff pairs to predict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Protein Structure and Dynamics · Genetics, Bioinformatics, and Biomedical Research
