Biases in In Silico Evaluation of Molecular Optimization Methods and Bias-Reduced Evaluation Methodology
Hiroshi Kajino, Kohei Miyaguchi, Takayuki Osogami

TL;DR
This paper examines biases in in silico molecular optimization evaluation, identifies two key sources of bias, and proposes bias reduction methods, supported by empirical analysis to improve evaluation accuracy.
Contribution
It introduces a comprehensive analysis of biases in molecular optimization evaluation and proposes methods to reduce these biases for more reliable assessments.
Findings
Predictor misspecification causes evaluation bias.
Sample reuse inflates performance estimates.
Bias reduction methods improve evaluation reliability.
Abstract
We are interested in in silico evaluation methodology for molecular optimization methods. Given a sample of molecules and their properties of our interest, we wish not only to train an agent that can find molecules optimized with respect to the target property but also to evaluate its performance. A common practice is to train a predictor of the target property on the sample and use it for both training and evaluating the agent. We show that this evaluator potentially suffers from two biases; one is due to misspecification of the predictor and the other to reusing the same sample for training and evaluation. We discuss bias reduction methods for each of the biases comprehensively, and empirically investigate their effectiveness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Analytical Chemistry and Chromatography · Various Chemistry Research Topics
