A COLD Approach to Generating Optimal Samples
Omar Mahmood, Jos\'e Miguel Hern\'andez-Lobato

TL;DR
COLD is a novel constrained global optimisation method that generates high-quality, diverse samples with desired properties, maintaining similarity to training data across image and molecular datasets.
Contribution
The paper introduces COLD, a new constrained optimisation approach that improves sample quality and diversity while ensuring similarity to training data in discrete data generation.
Findings
COLD finds optimal samples for multiple objectives on MNIST.
Tighter constraints improve sample quality and diversity.
Generates diverse drug-like molecules with high scores on ChEMBL.
Abstract
Optimising discrete data for a desired characteristic using gradient-based methods involves projecting the data into a continuous latent space and carrying out optimisation in this space. Carrying out global optimisation is difficult as optimisers are likely to follow gradients into regions of the latent space that the model has not been exposed to during training; samples generated from these regions are likely to be too dissimilar to the training data to be useful. We propose Constrained Optimisation with Latent Distributions (COLD), a constrained global optimisation procedure to find samples with high values of a desired property that are similar to yet distinct from the training data. We find that on MNIST, our procedure yields optima for each of three different objectives, and that enforcing tighter constraints improves the quality and increases the diversity of the generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Machine Learning and Data Classification
