Towards a Common Format for Computational Material Science Data
Luca M. Ghiringhelli (1), Christian Carbogno (1), Sergey Levchenko, (1), Fawzi Mohamed (1), Georg Huhs (2,3), Martin Lueders (4), Micael Oliveira, (5,6), Matthias Scheffler (1,7) ((1) Fritz Haber Institute of the Max, Planck Society, Berlin

TL;DR
This paper discusses the development of a standardized, code-independent data format for computational materials science, emphasizing the complementary strategies of converters and open libraries to facilitate data exchange.
Contribution
It introduces a unified format and conventions for electronic-structure data, developed collaboratively by ESL and NOMAD, promoting standardization in the field.
Findings
Agreed upon a common data format and conventions.
Demonstrated the complementarity of converters and open libraries.
Defined hierarchical metadata for electronic-structure calculations.
Abstract
Information and data exchange is an important aspect of scientific progress. In computational materials science, a prerequisite for smooth data exchange is standardization, which means using agreed conventions for, e.g., units, zero base lines, and file formats. There are two main strategies to achieve this goal. One accepts the heterogeneous nature of the community which comprises scientists from physics, chemistry, bio-physics, and materials science, by complying with the diverse ecosystem of computer codes and thus develops "converters" for the input and output files of all important codes. These converters then translate the data of all important codes into a standardized, code-independent format. The other strategy is to provide standardized open libraries that code developers can adopt for shaping their inputs, outputs, and restart files, directly into the same code-independent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Machine Learning in Materials Science · Advanced Data Storage Technologies
