Beyond Adam: Disentangling Optimizer Effects in the Fine-Tuning of Atomistic Foundation Models
Xiaoqing Liu, Yangshuai Wang, Teng Zhao

TL;DR
This paper benchmarks seven optimizers for fine-tuning atomistic models, revealing how optimizer choice affects accuracy, stability, and physical property predictions, and introduces a preconditioning framework for understanding optimizer effects.
Contribution
It provides a comprehensive benchmark and analysis of optimizer effects on atomistic model fine-tuning, introducing a preconditioning perspective to interpret optimizer behavior.
Findings
AdamW and ScheduleFree outperform others in curvature conditioning.
SGD shows slow convergence and instability.
Second-order refinement improves physical property accuracy.
Abstract
Atomistic foundation models constitute a paradigm shift in computational materials science by providing universal machine-learned interatomic potentials with broad transferability across chemical spaces. Although fine-tuning is essential for adapting these pretrained models to specific target systems, the influence of the optimization algorithm on this process remains insufficiently characterized. In this work, we perform a rigorous benchmark of seven first-order optimizers, including Adam, AdamW, RAdam, SGD, LAMB, Ranger, and ScheduleFree, for the fine-tuning of foundation models across molecular, crystalline, and liquid regimes. We evaluate these algorithms based on energy and force accuracy for both in-distribution and out-of-distribution configurations, as well as their impact on downstream physical properties such as elastic moduli, phonon spectra, and interfacial dynamics. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Block Copolymer Self-Assembly · Model Reduction and Neural Networks
