Kernel-Elastic Autoencoder for Molecular Design
Haote Li, Yu Shee, Brandon Allen, Federica Maschietto, Victor Batista

TL;DR
The Kernel-Elastic Autoencoder (KAE) is a transformer-based generative model that achieves high diversity and accuracy in molecular design, enabling conditional generation and outperforming existing models in various benchmarks.
Contribution
KAE introduces two novel loss functions and demonstrates state-of-the-art performance in molecular generation, reconstruction, and conditional design tasks.
Findings
KAE achieves near-perfect reconstruction and high diversity in molecule generation.
KAE outperforms previous models in constrained optimization and docking affinity predictions.
KAE enables conditional molecule generation based on binding affinity scores.
Abstract
We introduce the Kernel-Elastic Autoencoder (KAE), a self-supervised generative model based on the transformer architecture with enhanced performance for molecular design. KAE is formulated based on two novel loss functions: modified maximum mean discrepancy and weighted reconstruction. KAE addresses the long-standing challenge of achieving valid generation and accurate reconstruction at the same time. KAE achieves remarkable diversity in molecule generation while maintaining near-perfect reconstructions on the independent testing dataset, surpassing previous molecule-generating models. KAE enables conditional generation and allows for decoding based on beam search resulting in state-of-the-art performance in constrained optimizations. Furthermore, KAE can generate molecules conditional to favorable binding affinities in docking applications as confirmed by AutoDock Vina and Glide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Chemical Synthesis and Analysis
MethodsGuided Language to Image Diffusion for Generation and Editing
