Pharmacophore-based design by learning on voxel grids
Omar Mahmood, Pedro O. Pinheiro, Richard Bonneau, Saeed Saremi, Vishnu Sresht

TL;DR
This paper introduces VoxCap, a voxel-based generative model for pharmacophore-based drug discovery that improves scalability, diversity, and efficiency in virtual screening and de-novo molecule design.
Contribution
The paper presents VoxCap, a novel voxel captioning model for generating molecules from 3D voxel representations, addressing scalability and diversity limitations of traditional methods.
Findings
VoxCap outperforms previous methods in generating diverse de-novo hits.
It reduces computational time by orders of magnitude in fast search workflows.
VoxCap effectively combines generative design with substructure similarity search.
Abstract
Ligand-based drug discovery (LBDD) relies on making use of known binders to a protein target to find structurally diverse molecules similarly likely to bind. This process typically involves a brute force search of the known binder (query) against a molecular library using some metric of molecular similarity. One popular approach overlays the pharmacophore-shape profile of the known binder to 3D conformations enumerated for each of the library molecules, computes overlaps, and picks a set of diverse library molecules with high overlaps. While this virtual screening workflow has had considerable success in hit diversification, scaffold hopping, and patent busting, it scales poorly with library sizes and restricts candidate generation to existing library compounds. Leveraging recent advances in voxel-based generative modelling, we propose a pharmacophore-based generative model and…
Peer Reviews
Decision·Submitted to ICLR 2025
There isn't obvious strength. Weaknesses about the originality and significance are listed below.
* In the related work section, the authors pointed out some related work about voxel-based molecule generation. However, the authors didn’t discuss the difference between their proposed method and related work. * From my opinion, the proposed method has very limited contribution, where using 3D-CNN and LSTM as encoder and decoder to generate SMILES from molecular voxels has already well studied * The figure 1 is unclear. Please use higher resolution figures and distinct visual markers for legen
Novel combination of generative design and 2D substructure similarity search Innovative voxel captioning approach for SMILES generation Creative solution to address traditional pharmacophore screening limitations
Conceptual Framework and Definitions: A concern lies in the manuscript's handling of core concepts. The authors' definition of a pharmacophore requires significant refinement. A pharmacophore isn't simply a "feature of a molecule" but rather a collection of essential features arranged in a specific 3D pattern. The current definition using "may be involved in binding" is too tentative - pharmacophore features are understood to be essential for binding. Furthermore, the statement that "Drug discov
- This work builds upon 3D ligand-based virtual screening where the proprietary 3D shape matching software, ROCS, is commonly deployed and as such is highly relevant to practical drug discovery. - The proposed method, VoxCap, generates molecules with substantial 3D shape and pharmacophoric similarity to query molecules surpassing the baseline PGMG in generating 'hits', 'unique scaffold hits' and maximising the shape similarity score to the query.
Major: 1. Experimental results could be strengthened: - De novo design - PGMG is a relevant baseline, however data processing differs between VoxCap and PGMG making it difficult to interpret which part of the workflow contributes to difference in performance. The authors could investigate a hybrid approach method which matches the conformer generation and voxelization approach. Or in general provide an ablation study that shows why VoxCap is better than other methods. - Comparison wit
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Cell Image Analysis Techniques · Protein Structure and Dynamics
