aims-PAX: Parallel Active eXploration for the automated construction of Machine Learning Force Fields
Tobias Henkes, Shubham Sharma, Alexandre Tkatchenko, Mariana Rossi, Igor Poltavskyi

TL;DR
aims-PAX is a versatile, high-performance active learning framework that accelerates the development of accurate machine learning force fields for complex molecular and material systems, reducing computational costs significantly.
Contribution
It introduces a modular, scalable active learning workflow integrated with ab initio codes, enabling rapid and reliable MLFF development across diverse systems.
Findings
Achieves up to 1000-fold reduction in reference calculations.
Automatically identifies challenging chemical systems.
Enables efficient simulation of large solvated molecules.
Abstract
Recent advances in machine learning force fields (MLFF) have significantly extended the reach of atomistic simulations. Continuous progress in this field requires reliable reference datasets, accurate MLFF architectures, and efficient active learning strategies to enable robust modeling of complex molecular and material systems. Here we introduce aims-PAX, an expedited, multi-trajectory active learning framework that streamlines the development of stable and accurate MLFFs. Designed for a wide range of researchers, aims-PAX offers a modular, high-performance workflow that couples diversified sampling with scalable training across CPU and GPU architectures. Integrated with the widely used ab initio code FHI-aims, the framework supports state-of-the-art ML models and dataset generation using general-purpose (or "foundational") force-fields for rapid deployment in diverse systems. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical and Computational Modeling · Machine Learning and Data Classification · Mathematics, Computing, and Information Processing
