LAMBench: A Benchmark for Large Atomistic Models
Anyang Peng, Chun Cai, Mingyu Guo, Duo Zhang, Chengqian Zhang, Wanrun Jiang, Yinan Wang, Antoine Loew, Chengkun Wu, Weinan E, Linfeng Zhang, Han Wang

TL;DR
LAMBench is a new benchmarking system that evaluates large atomistic models (LAMs) on their generalizability and applicability, revealing gaps in current models and guiding future improvements for scientific discovery.
Contribution
This paper introduces LAMBench, the first comprehensive benchmark for assessing the performance and universality of large atomistic models across diverse scientific contexts.
Findings
Current LAMs show significant gaps compared to the ideal universal potential energy surface.
Incorporating cross-domain training data improves model generalizability.
Supporting multi-fidelity modeling and ensuring model conservativeness enhances robustness.
Abstract
Large Atomistic Models (LAMs) have undergone remarkable progress recently, emerging as universal or fundamental representations of the potential energy surface defined by the first-principles calculations of atomistic systems. However, our understanding of the extent to which these models achieve true universality, as well as their comparative performance across different models, remains limited. This gap is largely due to the lack of comprehensive benchmarks capable of evaluating the effectiveness of LAMs as approximations to the universal potential energy surface. In this study, we introduce LAMBench, a benchmarking system designed to evaluate LAMs in terms of their generalizability, adaptability, and applicability. These attributes are crucial for deploying LAMs as ready-to-use tools across a diverse array of scientific discovery contexts. We benchmark ten state-of-the-art LAMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
