Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees
Alexia Jolicoeur-Martineau, Aristide Baratin, Kisoo Kwon, Boris Knyazev, Yan Zhang

TL;DR
This paper introduces STGG+, an advanced molecule generation method that uses spanning trees and Transformer architecture to generate valid molecules conditioned on multiple properties, outperforming previous models in quality and flexibility.
Contribution
STGG+ extends spanning tree-based graph generation to multi-property conditional generation with self-criticism, Transformer integration, and classifier-free guidance, achieving state-of-the-art results.
Findings
Outperforms existing models in conditional molecule generation
Effective in out-of-distribution scenarios
Enables flexible property conditioning
Abstract
Generating novel molecules is challenging, with most representations leading to generative models producing many invalid molecules. Spanning Tree-based Graph Generation (STGG) is a promising approach to ensure the generation of valid molecules, outperforming state-of-the-art SMILES and graph diffusion models for unconditional generation. In the real world, we want to be able to generate molecules conditional on one or multiple desired properties rather than unconditionally. Thus, in this work, we extend STGG to multi-property-conditional generation. Our approach, STGG+, incorporates a modern Transformer architecture, random masking of properties during training (enabling conditioning on any subset of properties and classifier-free guidance), an auxiliary property-prediction loss (allowing the model to self-criticize molecules and select the best ones), and other improvements. We show…
Peer Reviews
Decision·Submitted to ICLR 2025
The paper presents molecule generation models, allowing multi-property control and self-assessment of generated molecules. It’s well-designed, with detailed experiments showing strong results across different datasets. The writing is clear and structured.
The self-criticism mechanism for filtering generated molecules based on property predictions is a key feature, but there is limited evaluation of its accuracy. Detailed analysis will be necessary. I am not an expert of this field so I will lower my confidence score.
1. To my knowledge, this manuscript is the first to thoroughly examine STGG for reward conditioning and/or optimization. 2. The work reflects a substantial effort to assess STGG+'s capabilities. Overall, the approach appears methodologically sound. 3. Molecular property optimization is an open challenge. Given its competitive performance compared to existing algorithms and its use of a (somewhat) unique molecular representation, I expect this work will attract reasonable interest.
1. The primary limitation of this paper is that generating 'valid' molecules does not guarantee synthesizability. Many molecules presented in the appendix would be very challenging, if not impossible, to synthesize. Meanwhile, some baseline methods may perform slightly worse on reward but produce molecules that are easier to synthesize, avoiding "reward hacking." A fairer comparison would involve evaluating the reward optimization performance of synthesizable molecules across different algorithm
- This paper is presented with sufficiently clear descriptions. - The authors explored a wide range of techniques that can be applied in the under-explored context of multi-property conditional generation.
- It seems to me that the authors invented complicated ad-hoc designs and specifically engineered to fix any issues that may arise, for example by masking the creation of rings when reaching max number (100), or alternating the use of CFG and ranking via a property-predictor. I'm afraid this hampers the overall generality of the proposed method. - Ablation studies are missing. What's the effect of the improved Transformer architecture against the vanilla one? How does the auxiliary property pred
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular Junctions and Nanostructures · Advanced biosensing and bioanalysis techniques · Monoclonal and Polyclonal Antibodies Research
MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Linear Layer · Label Smoothing · Diffusion · Adam · Dropout · Multi-Head Attention · Dense Connections
