Metatensor and metatomic: foundational libraries for interoperable atomistic machine learning

Filippo Bigi; Joseph W. Abbott; Philip Loche; Arslan Mazitov; Davide Tisi; Marcel F. Langer; Alexander Goscinski; Paolo Pegolo; Sanggyu Chong; Rohit Goswami; Pol Febrer; Sofiia Chorna; Matthias Kellner; Michele Ceriotti; Guillaume Fraux

arXiv:2508.15704·physics.chem-ph·March 9, 2026

Metatensor and metatomic: foundational libraries for interoperable atomistic machine learning

Filippo Bigi, Joseph W. Abbott, Philip Loche, Arslan Mazitov, Davide Tisi, Marcel F. Langer, Alexander Goscinski, Paolo Pegolo, Sanggyu Chong, Rohit Goswami, Pol Febrer, Sofiia Chorna, Matthias Kellner, Michele Ceriotti, Guillaume Fraux

PDF

TL;DR

This paper introduces two software libraries, metatensor and metatomic, designed to enable seamless data sharing, model management, and interoperability between atomistic machine learning frameworks and traditional simulation tools, addressing key integration challenges.

Contribution

The paper presents novel libraries that facilitate interoperability and data management for atomistic ML, bridging the gap between ML frameworks and simulation software.

Findings

01

Metatensor enables multi-platform, multi-language array manipulation with geometric metadata.

02

Metatomic provides portable storage for atomistic ML models and metadata.

03

The libraries improve integration and data sharing between ML and atomistic simulation tools.

Abstract

Incorporation of machine learning (ML) techniques into atomic-scale modeling has proven to be an extremely effective strategy to improve the accuracy and reduce the computational cost of simulations. It also entails conceptual and practical challenges, as it involves combining very different mathematical foundations, as well as software ecosystems that are very well developed in their own right, but do not share many commonalities. To address these issues and facilitate the adoption of ML in atomistic simulations, we introduce two dedicated software libraries. The first one, metatensor, provides multi-platform and multi-language storage and manipulation of arrays with many potentially sparse indices, designed from the ground up for atomistic ML applications. By combining the actual values with metadata that describes their nature and that facilitates the handling of geometric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.