Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models
Gabriel Lima Guimaraes, Benjamin Sanchez-Lengeling, Carlos Outeiral,, Pedro Luis Cunha Farias, Al\'an Aspuru-Guzik

TL;DR
This paper introduces ORGAN, a novel model combining GANs and reinforcement learning to steer sequence generation towards specific metrics while retaining learned data characteristics, demonstrated in molecule and music generation.
Contribution
The paper presents ORGAN, a new approach that integrates GANs with reinforcement learning to control sequence generation based on desired metrics.
Findings
Effective biasing of sequence generation towards target metrics.
Successful application to molecule (SMILES) and music data.
Maintains data fidelity while steering outputs.
Abstract
In unsupervised data generation tasks, besides the generation of a sample based on previous observations, one would often like to give hints to the model in order to bias the generation towards desirable metrics. We propose a method that combines Generative Adversarial Networks (GANs) and reinforcement learning (RL) in order to accomplish exactly that. While RL biases the data generation process towards arbitrary metrics, the GAN component of the reward function ensures that the model still remembers information learned from data. We build upon previous results that incorporated GANs and RL in order to generate sequence data and test this model in several settings for the generation of molecules encoded as text sequences (SMILES) and in the context of music generation, showing for each case that we can effectively bias the generation process towards desired metrics.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Materials Science · Natural Language Processing Techniques
MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729
