LAMBench: A Benchmark for Large Atomistic Models

Anyang Peng; Chun Cai; Mingyu Guo; Duo Zhang; Chengqian Zhang; Wanrun Jiang; Yinan Wang; Antoine Loew; Chengkun Wu; Weinan E; Linfeng Zhang; Han Wang

arXiv:2504.19578·physics.comp-ph·August 19, 2025·2 cites

LAMBench: A Benchmark for Large Atomistic Models

Anyang Peng, Chun Cai, Mingyu Guo, Duo Zhang, Chengqian Zhang, Wanrun Jiang, Yinan Wang, Antoine Loew, Chengkun Wu, Weinan E, Linfeng Zhang, Han Wang

PDF

Open Access 1 Repo

TL;DR

LAMBench is a new benchmarking system that evaluates large atomistic models (LAMs) on their generalizability and applicability, revealing gaps in current models and guiding future improvements for scientific discovery.

Contribution

This paper introduces LAMBench, the first comprehensive benchmark for assessing the performance and universality of large atomistic models across diverse scientific contexts.

Findings

01

Current LAMs show significant gaps compared to the ideal universal potential energy surface.

02

Incorporating cross-domain training data improves model generalizability.

03

Supporting multi-fidelity modeling and ensuring model conservativeness enhances robustness.

Abstract

Large Atomistic Models (LAMs) have undergone remarkable progress recently, emerging as universal or fundamental representations of the potential energy surface defined by the first-principles calculations of atomistic systems. However, our understanding of the extent to which these models achieve true universality, as well as their comparative performance across different models, remains limited. This gap is largely due to the lack of comprehensive benchmarks capable of evaluating the effectiveness of LAMs as approximations to the universal potential energy surface. In this study, we introduce LAMBench, a benchmarking system designed to evaluate LAMs in terms of their generalizability, adaptability, and applicability. These attributes are crucial for deploying LAMs as ready-to-use tools across a diverse array of scientific discovery contexts. We benchmark ten state-of-the-art LAMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deepmodeling/lambench
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science