A reproducibility study of "Augmenting Genetic Algorithms with Deep Neural Networks for Exploring the Chemical Space"
Kevin Maik Jablonka, Fergus Mcilwaine, Susana Garcia, Berend Smit,, Brian Yoo

TL;DR
This study critically examines the reproducibility of Nigam et al.'s genetic algorithm for molecule generation, confirming some claims but highlighting issues with fitness function exploitation and proposing improvements.
Contribution
It reproduces key results of the original GA, analyzes its limitations, and suggests enhancements to the adaptive penalty mechanism for better diversity.
Findings
Reproduced the GA results using SELFIES representation.
Identified that fitness function deficiencies can be exploited.
Proposed improvements to the adaptive penalty for diversity.
Abstract
Nigam et al. reported a genetic algorithm (GA) utilizing the SELFIES representation and also propose an adaptive, neural network-based penalty that is supposed to improve the diversity of the generated molecules. The main claims of the paper are that this GA outperforms other generative techniques (as measured by the penalized logP) and that a neural network-based adaptive penalty increases the diversity of the generated molecules. In this work, we investigated the reproducibility of their claims. Overall, we were able to reproduce comparable results using the SELFIES-based GA, but mostly by exploiting deficiencies of the (easily optimizable) fitness function (i.e., generating long, sulfur containing chains). In addition, we reproduce results showing that the discriminator can be used to bias the generation of molecules to ones that are similar to the reference set. Lastly, we attempted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Metabolomics and Mass Spectrometry Studies
MethodsGenetic Algorithms
