Cell Morphology-Guided Small Molecule Generation with GFlowNets
Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso, Biancalani, Yoshua Bengio, Gabriele Scalia, Micha{\l} Koziarski

TL;DR
This paper introduces a novel method for HCI-guided molecular design using GFlowNets and unsupervised multimodal embeddings, enabling the generation of molecules with phenotypic effects similar to target images without extensive labeled data.
Contribution
It proposes an innovative approach combining GFlowNets with unsupervised multimodal embeddings for phenotype-guided molecule generation, bypassing the need for labeled phenotypic data.
Findings
Generated molecules show high morphological similarity to targets
The method increases likelihood of similar biological activity
Effective in low-data, high-dimensional phenotypic spaces
Abstract
High-content phenotypic screening, including high-content imaging (HCI), has gained popularity in the last few years for its ability to characterize novel therapeutics without prior knowledge of the protein target. When combined with deep learning techniques to predict and represent molecular-phenotype interactions, these advancements hold the potential to significantly accelerate and enhance drug discovery applications. This work focuses on the novel task of HCI-guided molecular design. Generative models for molecule design could be guided by HCI data, for example with a supervised model that links molecules to phenotypes of interest as a reward function. However, limited labeled data, combined with the high-dimensional readouts, can make training these methods challenging and impractical. We consider an alternative approach in which we leverage an unsupervised multimodal joint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Genetics, Bioinformatics, and Biomedical Research
