Stiefel Flow Matching for Moment-Constrained Structure Elucidation
Austin Cheng, Alston Lo, Kin Long Kelvin Lee, Santiago Miret, Al\'an, Aspuru-Guzik

TL;DR
This paper introduces Stiefel Flow Matching, a novel generative model that accurately predicts 3D molecular structures from moments of inertia by leveraging the geometry of the Stiefel manifold, outperforming existing methods.
Contribution
The paper formulates the structure elucidation problem on the Stiefel manifold and develops a flow matching approach that enforces exact moment constraints for improved molecular structure prediction.
Findings
Achieves higher success rates in structure prediction.
Enables faster sampling compared to Euclidean diffusion models.
Effectively handles high-dimensional molecular data.
Abstract
Molecular structure elucidation is a fundamental step in understanding chemical phenomena, with applications in identifying molecules in natural products, lab syntheses, forensic samples, and the interstellar medium. We consider the task of predicting a molecule's all-atom 3D structure given only its molecular formula and moments of inertia, motivated by the ability of rotational spectroscopy to measure these moments. While existing generative models can conditionally sample 3D structures with approximately correct moments, this soft conditioning fails to leverage the many digits of precision afforded by experimental rotational spectroscopy. To address this, we first show that the space of -atom point clouds with a fixed set of moments of inertia is embedded in the Stiefel manifold . We then propose Stiefel Flow Matching as a generative model for elucidating 3D…
Peer Reviews
Decision·ICLR 2025 Poster
- As far as I can tell, the approach used in the paper of embedding the molecular structure elucidation task in the Stiefel manifold is original. This approach leverages manifold geometry to respect exact moment constraints, improving over traditional Euclidean diffusion models. - In general, the paper is written and formatted very well. I appreciate the consistent and well-written formalisms and figures, making the otherwise quite abstract paper well-readable. Moreover, the formalisms support
- Most obviously, for larger datasets like GEOM, the model struggles with validity and stability of generated structures, indicating possible underfitting issues. I think some more concrete suggestions on how to improve these issues would be greatly beneficial. - Some connections to recent approaches to molecular generation with discrete flow matching are missing in the paper. Even though these problems consider a slightly different task as they operate on discrete domains, some intuition of why
The paper shows the effectiveness of the proposed approach through evaluations over two molecule benchmarks: QM9 and GEOM. The method achieved lower RMSD for the generated molecules with a lower number of function evaluations (NFE) compared to the Euclidean diffusion model.
Limited model comparisons: The paper only compares its results against one method in the literature, where it might consider other works on Riemannian generative models.
- The paper is generally well-written and easy to follow. The math is sound and clearly presented. - The use of Stiefel manifold and its connection to the problem and the constraints in (2). - The use of geodesics in the Stiefel manifold that provides straight lines for navigating the manifold.
- How generalizable is the proposed method? Can the authors perform OOD evaluation? - Why the need to generate K=10 samples and report only the one with the lowest RMSD? Is this the standard practice? if yes, support is needed. If not, then performing this indicates instability which requires further investigation. - The evaluation metric for the proposed method seems to depends on RMSD thresholds defined by the authors. Have these thresholds been employed by previous methods that utilize ne
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks
MethodsSparse Evolutionary Training · Diffusion
