Exploring QSAR Models for Activity-Cliff Prediction
Markus Dablander, Thierry Hanser, Renaud Lambiotte, Garrett M. Morris

TL;DR
This study evaluates the ability of various QSAR models to predict activity cliffs, revealing their limitations and proposing potential improvements for better activity cliff prediction in drug discovery.
Contribution
The paper systematically compares multiple QSAR models and representations for activity cliff prediction, highlighting the strengths of graph isomorphism features and proposing twin-network training for improvement.
Findings
Low activity cliff sensitivity when compound activities are unknown
Graph isomorphism features outperform classical representations in AC classification
QSAR methods often fail to predict activity cliffs accurately
Abstract
Pairs of similar compounds that only differ by a small structural modification but exhibit a large difference in their binding affinity for a given target are known as activity cliffs (ACs). It has been hypothesised that quantitative structure-activity relationship (QSAR) models struggle to predict ACs and that ACs thus form a major source of prediction error. However, a study to explore the AC-prediction power of modern QSAR methods and its relationship to general QSAR-prediction performance is lacking. We systematically construct nine distinct QSAR models by combining three molecular representation methods (extended-connectivity fingerprints, physicochemical-descriptor vectors and graph isomorphism networks) with three regression techniques (random forests, k-nearest neighbours and multilayer perceptrons); we then use each resulting model to classify pairs of similar compounds as ACs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Cholinesterase and Neurodegenerative Diseases · Machine Learning in Materials Science
Methodsfail
