Physics-Learning AI Datamodel (PLAID) datasets: a collection of physics simulations for machine learning
Fabien Casenave, Xavier Roynard, Brian Staber, William Piat, Michele, Alessandro Bucci, Nissrine Akkari, Abbas Kabalan, Xuan Minh Vuong Nguyen,, Luca Saverio, Rapha\"el Carpintero Perez, Anthony Kalaydjian, Samy Fouch\'e,, Thierry Gonon, Ghassan Najjar, Emmanuel Menier

TL;DR
PLAID introduces a standardized, extensible framework and datasets for physics simulations to facilitate machine learning applications across diverse physical domains.
Contribution
It provides a unified data model, a library for dataset handling, and six benchmark datasets covering mechanics and fluid dynamics, promoting standardization and community engagement.
Findings
Six curated physics simulation datasets released under PLAID standard
Baseline benchmarks established for various machine learning methods
Tools available for community participation and ongoing evaluation
Abstract
Machine learning-based surrogate models have emerged as a powerful tool to accelerate simulation-driven scientific workflows. However, their widespread adoption is hindered by the lack of large-scale, diverse, and standardized datasets tailored to physics-based simulations. While existing initiatives provide valuable contributions, many are limited in scope-focusing on specific physics domains, relying on fragmented tooling, or adhering to overly simplistic datamodels that restrict generalization. To address these limitations, we introduce PLAID (Physics-Learning AI Datamodel), a flexible and extensible framework for representing and sharing datasets of physics simulations. PLAID defines a unified standard for describing simulation data and is accompanied by a library for creating, reading, and manipulating complex datasets across a wide range of physical use cases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Machine Learning in Materials Science · Model Reduction and Neural Networks
MethodsLib
