Reinforcement Fine-Tuning for Materials Design
Zhendong Cao, Lei Wang

TL;DR
This paper introduces CrystalFormer-RL, a reinforcement fine-tuning approach that improves crystal generation and property optimization, enabling more stable and desirable materials discovery using machine learning.
Contribution
It presents a novel reinforcement fine-tuning method for generative models in materials design, integrating discriminative model rewards to enhance crystal quality and property control.
Findings
Enhanced stability in generated crystals.
Successful discovery of crystals with conflicting properties.
Reinforcement fine-tuning enables property-guided design.
Abstract
Reinforcement fine-tuning played an instrumental role in enhancing the instruction-following and reasoning abilities of large language models. In this work, we employ reinforcement fine-tuning for materials design, in which discriminative machine learning models are used to provide rewards to the autoregressive transformer-based materials generative model CrystalFormer. By optimizing the reward signals-such as energy above the convex hull and material properties figures of merit-reinforcement fine-tuning infuses knowledge from discriminative models into generative models. The resulting model, CrystalFormer-RL, shows enhanced stability in generated crystals and successfully discovers crystals with desirable yet conflicting material properties, such as substantial dielectric constant and band gap simultaneously. Notably, we observe that reinforcement fine-tuning not only enables the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced ceramic materials synthesis · Material Properties and Applications · Material Selection and Properties
