Biases in In Silico Evaluation of Molecular Optimization Methods and   Bias-Reduced Evaluation Methodology

Hiroshi Kajino; Kohei Miyaguchi; Takayuki Osogami

arXiv:2201.12163·cs.LG·January 31, 2022·1 cites

Biases in In Silico Evaluation of Molecular Optimization Methods and Bias-Reduced Evaluation Methodology

Hiroshi Kajino, Kohei Miyaguchi, Takayuki Osogami

PDF

Open Access

TL;DR

This paper examines biases in in silico molecular optimization evaluation, identifies two key sources of bias, and proposes bias reduction methods, supported by empirical analysis to improve evaluation accuracy.

Contribution

It introduces a comprehensive analysis of biases in molecular optimization evaluation and proposes methods to reduce these biases for more reliable assessments.

Findings

01

Predictor misspecification causes evaluation bias.

02

Sample reuse inflates performance estimates.

03

Bias reduction methods improve evaluation reliability.

Abstract

We are interested in in silico evaluation methodology for molecular optimization methods. Given a sample of molecules and their properties of our interest, we wish not only to train an agent that can find molecules optimized with respect to the target property but also to evaluate its performance. A common practice is to train a predictor of the target property on the sample and use it for both training and evaluating the agent. We show that this evaluator potentially suffers from two biases; one is due to misspecification of the predictor and the other to reusing the same sample for training and evaluation. We discuss bias reduction methods for each of the biases comprehensively, and empirically investigate their effectiveness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods · Analytical Chemistry and Chromatography · Various Chemistry Research Topics