MatRIS: Toward Reliable and Efficient Pretrained Machine Learning Interatomic Potentials
Yuanchang Zhou, Siyu Hu, Xiangyu Zhang, Hongyu Wang, Guangming Tan, Weile Jia

TL;DR
MatRIS introduces an attention-based invariant machine learning interatomic potential that achieves high accuracy and scalability, matching or surpassing equivariant models while reducing computational costs in materials science applications.
Contribution
The paper presents MatRIS, a novel invariant MLIP with a linear complexity attention mechanism, enabling scalable and accurate modeling of atomic interactions.
Findings
MatRIS achieves comparable accuracy to state-of-the-art equivariant models.
MatRIS demonstrates lower training costs while maintaining high performance.
The model scales linearly with the number of atoms, enabling large-scale applications.
Abstract
Foundation MLIPs demonstrate broad applicability across diverse material systems and have emerged as a powerful and transformative paradigm in chemical and computational materials science. Equivariant MLIPs achieve state-of-the-art accuracy in a wide range of benchmarks by incorporating equivariant inductive bias. However, the reliance on tensor products and high-degree representations makes them computationally costly. This raises a fundamental question: as quantum mechanical-based datasets continue to expand, can we develop a more compact model to thoroughly exploit high-dimensional atomic interactions? In this work, we present MatRIS (\textbf{Mat}erials \textbf{R}epresentation and \textbf{I}nteraction \textbf{S}imulation), an invariant MLIP that introduces attention-based modeling of three-body interactions. MatRIS leverages a novel separable attention mechanism with linear…
Peer Reviews
Decision·ICLR 2026 Poster
1. The architecture is new. The authors introduce a new architectural approach by explicitly modeling three-body interactions using a line graph attention mechanism. Each edge in the atom graph becomes a node in the line graph, enabling the model to attend over bond angles. This is claimed to be the first use of attention for three-body interactions in ML potentials, extending beyond traditional message-passing that usually considers only pairwise interactions. 2. They employ a separable attenti
1. The evaluation on large-scale datasets is limited. A notable gap in the experiments is the absence of results on the latest massive and diverse datasets, specifically Open Materials 2024 (OMat 24) and Open Molecules 2025 (OMol 25). If the model is claimed to be scalable, then it should perform well on these datasets. At least a benchmark on OMol 4M would be nice to have. 2. Despite the terminology, MatRIS’s mechanism differs from standard scaled dot-product self-attention with Q/K/V projectio
The model, MatRIS, achieves state-of-the-art efficiency of the compliant track of Matbench Discovery with the F1 score of 0.844, and performs equally well on other benchmarks - MatCalc, Molecular Zero Shot Benchmark, and DPA2. The paper provides several ablations on the model design and fitting procedure. The computational efficiency of the MatRIS model is competitive in both training and inference.
The position of the available MLIPs architectures into invariant and equivariant groups is not correct. The paper groups all the models into only two groups, invariant and equivariant, missing the third big group of models - rotationally unconstrained architectures, and assigns many of the rotationally unconstrained architectures into the invariant groups. Because of this, some claims are not entirely true, for instance, "Recent studies (Neumann et al., 2024; Qu & Krishnapriyan, 2024; Rhodes et
1. Using self-attention in the line graph to encode three-body information is novel?, sound, and simple. 2. The separable attention is also with reasonable motivation. 3. The method performs well in several benchmarks. 4. The experiments are sufficient.
1. Although I assume the methodology is novel, some evidences need to be given with respect to the claim in L94-95: ``$\cdots$, our model is the first to leverage attention mechanisms for modeling three-body interaction''. Specifically, the authors should justify its differences to VisNet [1], FreeCG [2], and MGT [3], which all adopted self-attention mechanisms and captured up to four-body interactions. Is that the case that your model uses self-attention to encode the many-body interactions whi
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Quantum many-body systems · Advanced Graph Neural Networks
