What Ails Generative Structure-based Drug Design: Expressivity is Too Little or Too Much?
Rafa{\l} Karczewski, Samuel Kaski, Markus Heinonen, Vikas Garg

TL;DR
This paper investigates why current generative models for structure-based drug design underperform, analyzing their expressivity and proposing a simpler, more efficient model that achieves state-of-the-art results with fewer parameters.
Contribution
It provides the first theoretical analysis of GNN limitations in protein-ligand modeling and introduces a simple, parameter-efficient model that outperforms existing methods.
Findings
GNNs have representational limitations in protein-ligand complexes.
A simple metric-aware model achieves state-of-the-art results.
The proposed model is 100x smaller and 1000x faster than previous approaches.
Abstract
Several generative models with elaborate training and sampling procedures have been proposed to accelerate structure-based drug design (SBDD); however, their empirical performance turns out to be suboptimal. We seek to better understand this phenomenon from both theoretical and empirical perspectives. Since most of these models apply graph neural networks (GNNs), one may suspect that they inherit the representational limitations of GNNs. We analyze this aspect, establishing the first such results for protein-ligand complexes. A plausible counterview may attribute the underperformance of these models to their excessive parameterizations, inducing expressivity at the expense of generalization. We investigate this possibility with a simple metric-aware approach that learns an economical surrogate for affinity to infer an unlabelled molecular graph and optimizes for labels conditioned on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Chemical Synthesis and Analysis
