Attention Based Molecule Generation via Hierarchical Variational Autoencoder
Divahar Sivanesan

TL;DR
This paper introduces a hierarchical neural network model combining RNNs and CNNs for molecule generation, achieving high validity and effective mapping between molecules and their representations.
Contribution
It proposes a novel hierarchical architecture that improves molecule generation by capturing long-range dependencies and reducing invalid outputs.
Findings
95% validity rate in molecule reconstruction
Average Tanimoto similarity of 0.6 between test and reconstructed molecules
Enhanced mapping between SMILES strings and learned representations
Abstract
Molecule generation is a task made very difficult by the complex ways in which we represent molecules computationally. A common technique used in molecular generative modeling is to use SMILES strings with recurrent neural networks built into variational autoencoders - but these suffer from a myriad of issues: vanishing gradients, long-range forgetting, and invalid molecules. In this work, we show that by combining recurrent neural networks with convolutional networks in a hierarchical manner, we are able to both extract autoregressive information from SMILES strings while maintaining signal and long-range dependencies. This allows for generations with very high validity rates on the order of 95% when reconstructing known molecules. We also observe an average Tanimoto similarity of .6 between test set and reconstructed molecules, which suggests our method is able to map between SMILES…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Image Processing Techniques and Applications
MethodsSparse Evolutionary Training
