TL;DR
ADSeqGAN is a novel GAN-based framework that enhances molecular generation from small datasets by integrating an auxiliary classifier, pretrained components, and Wasserstein distance, leading to improved quality and diversity in drug discovery applications.
Contribution
This work introduces ADSeqGAN, the first GAN model with an auxiliary classifier and pretrained components specifically designed for small-sample molecular generation.
Findings
Outperforms baseline models in nucleic acid binder generation
Significantly improves CNS drug oversampling results
Generates novel CB1 ligands with high predicted activity
Abstract
In this work, we introduce Auxiliary Discriminator Sequence Generative Adversarial Networks (ADSeqGAN), a novel approach for molecular generation in small-sample datasets. Traditional generative models often struggle with limited training data, particularly in drug discovery, where molecular datasets for specific therapeutic targets, such as nucleic acids binders and central nervous system (CNS) drugs, are scarce. ADSeqGAN addresses this challenge by integrating an auxiliary random forest classifier as an additional discriminator into the GAN framework, significantly improves molecular generation quality and class specificity. Our method incorporates pretrained generator and Wasserstein distance to enhance training stability and diversity. We evaluate ADSeqGAN across three representative cases. First, on nucleic acid- and protein-targeting molecules, ADSeqGAN shows superior capability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
