Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training

Sa\'ul Alonso-Monsalve; Fabio Cufino; Umut Kose; Anna Mascellani; Andr\'e Rubbia

arXiv:2604.07037·hep-ex·April 9, 2026

Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training

Sa\'ul Alonso-Monsalve, Fabio Cufino, Umut Kose, Anna Mascellani, Andr\'e Rubbia

PDF

TL;DR

This paper introduces a self-supervised learning framework using sparse ViT models to improve interpretation of complex, heterogeneous neutrino detector data at the energy frontier, enhancing performance and data efficiency.

Contribution

It presents a novel self-supervised pre-training approach for heterogeneous detector data, enabling better representation learning and transferability in neutrino physics analysis.

Findings

01

Pre-training improves neutrino flavor and charm-quark identification.

02

Relational objectives enhance performance in complex topologies.

03

Pre-trained models match or surpass baselines on various benchmarks.

Abstract

Accelerator-based neutrino physics is entering an energy-frontier regime in which interactions reach the TeV scale and produce exceptionally dense, overlapping detector signatures. In this regime, event interpretation becomes impractical for conventional reconstruction approaches, particularly when labelled data are scarce and the analysis spans diverse downstream objectives. We present a sparse ViT framework for learning reusable representations from heterogeneous detector data. Self-supervised pre-training combines masked autoencoder reconstruction with relational voxel-level objectives for hierarchy, ghost and particle identification, and the resulting shared encoder is then jointly fine-tuned across classification and regression tasks. Evaluated on simulated events from the proposed FASERCal concept at the LHC, we find that pre-training consistently improves neutrino flavour and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.