FAIR AI Models in High Energy Physics
Javier Duarte, Haoyang Li, Avik Roy, Ruike Zhu, E. A., Huerta, Daniel Diaz, Philip Harris, Raghav Kansal, Daniel S. Katz, and Ishaan H. Kavoori, Volodymyr V. Kindratenko, Farouk Mokhtar and, Mark S. Neubauer, Sang Eon Park, Melissa Quinnan, Roger Rusack and, Zhizhen Zhao

TL;DR
This paper proposes a practical framework for applying FAIR principles to AI models in high energy physics, demonstrating its use with a graph neural network for Higgs boson identification.
Contribution
It introduces a definition and template for FAIR AI models in HEP, enhancing model robustness, portability, and interpretability.
Findings
The FAIR AI model is robust across different hardware and software environments.
The model demonstrates high interpretability in identifying Higgs bosons.
Application of the template improves sharing and reuse of AI models in HEP.
Abstract
The findable, accessible, interoperable, and reusable (FAIR) data principles provide a framework for examining, evaluating, and improving how data is shared to facilitate scientific discovery. Generalizing these principles to research software and other digital products is an active area of research. Machine learning (ML) models -- algorithms that have been trained on data without being explicitly programmed -- and more generally, artificial intelligence (AI) models, are an important target for this because of the ever-increasing pace with which AI is transforming scientific domains, such as experimental high energy physics (HEP). In this paper, we propose a practical definition of FAIR principles for AI models in HEP and describe a template for the application of these principles. We demonstrate the template's use with an example AI model applied to HEP, in which a graph neural network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems · Research Data Management Practices
MethodsGraph Neural Network
