Toward Better Geometric Representations for Molecule Generative Models

Shaoheng Yan; Zian Li; Cai Zhou; Qiaojing Huang; Kai Liu; Muhan Zhang

arXiv:2605.07693·cs.LG·May 11, 2026

Toward Better Geometric Representations for Molecule Generative Models

Shaoheng Yan, Zian Li, Cai Zhou, Qiaojing Huang, Kai Liu, Muhan Zhang

PDF

TL;DR

This paper introduces LENSEs, a framework that enhances geometric molecule representations for generative models by aligning and refining pretrained encoder features, leading to improved validity and stability in molecule generation.

Contribution

LENSEs employs multi-level representation extraction, perceptual loss, and node-level alignment to better exploit pretrained molecular encoders in generative tasks.

Findings

01

Achieves 97.28% validity and 98.51% stability on GEOM-DRUG dataset.

02

Reduces Lipschitz constant by 4.6x, indicating smoother representations.

03

Demonstrates improved representations through QM9 probing tasks.

Abstract

Geometric representation-conditioned molecule generation provides an effective paradigm that decouples molecule representation modeling from structure generation. By decoupling molecule generation into two stages-first generating a meaningful molecule representation, and then generating a 3D molecule conditioned on this representation-the efficiency and quality of the generation process can be significantly enhanced. However, its effectiveness is fundamentally limited by the quality of the representation space: pretrained molecular encoders, such as UniMol, produce representations that are non-smooth and not fully exploited during the generative training process. In this work, we propose LENSEs, a framework that better exploits the potential of molecule representations in representation-conditioned generation methods. In particular, LENSEs introduces three complementary mechanisms: (1)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.