QSAR-Guided Generative Framework for the Discovery of Synthetically Viable Odorants
Tim C. Pearce, Ahmed Ibrahim

TL;DR
This paper introduces a QSAR-guided VAE framework that efficiently generates novel, syntactically valid odorant molecules with high structural diversity from limited training data, advancing de novo molecular design for fragrances.
Contribution
It combines a VAE with QSAR modeling to generate odorants from small datasets, enabling extensive chemical space exploration and structural novelty.
Findings
100% validity of generated molecules confirmed
94.8% of generated molecules are unique
74.4% of candidates have novel core frameworks
Abstract
The discovery of novel odorant molecules is key for the fragrance and flavor industries, yet efficiently navigating the vast chemical space to identify structures with desirable olfactory properties remains a significant challenge. Generative artificial intelligence offers a promising approach for \textit{de novo} molecular design but typically requires large sets of molecules to learn from. To address this problem, we present a framework combining a variational autoencoder (VAE) with a quantitative structure-activity relationship (QSAR) model to generate novel odorants from limited training sets of odor molecules. The self-supervised learning capabilities of the VAE allow it to learn SMILES grammar from ChemBL database, while its training objective is augmented with a loss term derived from an external QSAR model to structure the latent representation according to odor probability.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOlfactory and Sensory Function Studies · Computational Drug Discovery Methods · Advanced Chemical Sensor Technologies
