Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training
Sa\'ul Alonso-Monsalve, Fabio Cufino, Umut Kose, Anna Mascellani, Andr\'e Rubbia

TL;DR
This paper introduces a self-supervised learning framework using sparse ViT models to improve interpretation of complex, heterogeneous neutrino detector data at the energy frontier, enhancing performance and data efficiency.
Contribution
It presents a novel self-supervised pre-training approach for heterogeneous detector data, enabling better representation learning and transferability in neutrino physics analysis.
Findings
Pre-training improves neutrino flavor and charm-quark identification.
Relational objectives enhance performance in complex topologies.
Pre-trained models match or surpass baselines on various benchmarks.
Abstract
Accelerator-based neutrino physics is entering an energy-frontier regime in which interactions reach the TeV scale and produce exceptionally dense, overlapping detector signatures. In this regime, event interpretation becomes impractical for conventional reconstruction approaches, particularly when labelled data are scarce and the analysis spans diverse downstream objectives. We present a sparse ViT framework for learning reusable representations from heterogeneous detector data. Self-supervised pre-training combines masked autoencoder reconstruction with relational voxel-level objectives for hierarchy, ghost and particle identification, and the resulting shared encoder is then jointly fine-tuned across classification and regression tasks. Evaluated on simulated events from the proposed FASERCal concept at the LHC, we find that pre-training consistently improves neutrino flavour and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
