Open Materials Generation with Inference-Time Reinforcement Learning
Philipp Hoellmer, Stefano Martiniani

TL;DR
This paper introduces OMatG-IRL, a reinforcement learning framework that enhances continuous-time generative models for crystalline materials by enabling property-driven crystal structure prediction without explicit score computation.
Contribution
OMatG-IRL is the first RL application to crystal structure prediction, operating directly on velocity fields, improving sampling efficiency, and maintaining diversity in generative models.
Findings
Effective reinforcement of energy-based objectives in CSP.
Order-of-magnitude improvements in sampling efficiency.
Preservation of diversity through composition conditioning.
Abstract
Continuous-time generative models for crystalline materials enable inverse materials design by learning to predict stable crystal structures, but incorporating explicit target properties into the generative process remains challenging. Policy-gradient reinforcement learning (RL) provides a principled mechanism for aligning generative models with downstream objectives but typically requires access to the score, which has prevented its application to flow-based models that learn only velocity fields. We introduce Open Materials Generation with Inference-time Reinforcement Learning (OMatG-IRL), a policy-gradient RL framework that operates directly on the learned velocity fields and eliminates the need for the explicit computation of the score. OMatG-IRL leverages stochastic perturbations of the underlying generation dynamics preserving the baseline performance of the pretrained generative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis
