Shared Data and Algorithms for Deep Learning in Fundamental Physics
Lisa Benato, Erik Buhmann, Martin Erdmann, Peter Fackeldey, Jonas, Glombitza, Nikolai Hartmann, Gregor Kasieczka, William Korcari, Thomas Kuhr,, Jan Steinheimer, Horst St\"ocker, Tilman Plehn, Kai Zhou

TL;DR
This paper presents a unified Python package offering access to diverse fundamental physics datasets for machine learning, along with a versatile graph neural network approach that achieves near state-of-the-art performance across tasks.
Contribution
The authors introduce a unified data interface and reference models for fundamental physics datasets, facilitating cross-disciplinary machine learning and transfer learning research.
Findings
Achieves performance close to specialized methods on all datasets.
Provides a flexible graph neural network architecture adaptable to various physics problems.
Simplifies data handling and model implementation for fundamental physics machine learning.
Abstract
We introduce a Python package that provides simply and unified access to a collection of datasets from fundamental physics research - including particle physics, astroparticle physics, and hadron- and nuclear physics - for supervised machine learning studies. The datasets contain hadronic top quarks, cosmic-ray induced air showers, phase transitions in hadronic matter, and generator-level histories. While public datasets from multiple fundamental physics disciplines already exist, the common interface and provided reference models simplify future work on cross-disciplinary machine learning and transfer learning in fundamental physics. We discuss the design and structure and line out how additional datasets can be submitted for inclusion. As showcase application, we present a simple yet flexible graph-based neural network architecture that can easily be applied to a wide range of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
