Automating reward function configuration for drug design
Marius Urbonas, Temitope Ajileye, Paul Gainer, Douglas Pires

TL;DR
This paper introduces an automated, data-driven method for configuring reward functions in generative molecular design, improving efficiency and accuracy over manual approaches in AI-driven drug discovery.
Contribution
The authors propose a novel neural network-based approach that constructs reward functions from experimental data, outperforming human-designed functions in drug discovery tasks.
Findings
The method adapts over time to produce high-scoring compounds.
It outperforms human-defined reward functions in predictive accuracy.
Achieves up to 0.4 improvement in Spearman's correlation.
Abstract
Designing reward functions that guide generative molecular design (GMD) algorithms to desirable areas of chemical space is of critical importance in AI-driven drug discovery. Traditionally, this has been a manual and error-prone task; the selection of appropriate computational methods to approximate biological assays is challenging and the aggregation of computed values into a single score even more so, leading to potential reliance on trial-and-error approaches. We propose a novel approach for automated reward configuration that relies solely on experimental data, mitigating the challenges of manual reward adjustment on drug discovery projects. Our method achieves this by constructing a ranking over experimental data based on Pareto dominance over the multi-objective space, then training a neural network to approximate the reward function such that rankings determined by the predicted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science
