Toward Better Geometric Representations for Molecule Generative Models
Shaoheng Yan, Zian Li, Cai Zhou, Qiaojing Huang, Kai Liu, Muhan Zhang

TL;DR
This paper introduces LENSEs, a framework that enhances geometric molecule representations for generative models by aligning and refining pretrained encoder features, leading to improved validity and stability in molecule generation.
Contribution
LENSEs employs multi-level representation extraction, perceptual loss, and node-level alignment to better exploit pretrained molecular encoders in generative tasks.
Findings
Achieves 97.28% validity and 98.51% stability on GEOM-DRUG dataset.
Reduces Lipschitz constant by 4.6x, indicating smoother representations.
Demonstrates improved representations through QM9 probing tasks.
Abstract
Geometric representation-conditioned molecule generation provides an effective paradigm that decouples molecule representation modeling from structure generation. By decoupling molecule generation into two stages-first generating a meaningful molecule representation, and then generating a 3D molecule conditioned on this representation-the efficiency and quality of the generation process can be significantly enhanced. However, its effectiveness is fundamentally limited by the quality of the representation space: pretrained molecular encoders, such as UniMol, produce representations that are non-smooth and not fully exploited during the generative training process. In this work, we propose LENSEs, a framework that better exploits the potential of molecule representations in representation-conditioned generation methods. In particular, LENSEs introduces three complementary mechanisms: (1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
