A Language Model for Particle Tracking

Andris Huang; Yash Melkani; Paolo Calafiura; Alina Lazar; Daniel; Thomas Murnane; Minh-Tuan Pham; Xiangyang Ju

arXiv:2402.10239·hep-ph·February 19, 2024·1 cites

A Language Model for Particle Tracking

Andris Huang, Yash Melkani, Paolo Calafiura, Alina Lazar, Daniel, Thomas Murnane, Minh-Tuan Pham, Xiangyang Ju

PDF

Open Access 1 Models

TL;DR

This paper introduces TrackingBERT, a novel language model for particle tracking at the LHC, enabling better generalization and multi-task capabilities through a tokenized detector representation.

Contribution

The paper presents a new tokenized detector representation and trains a BERT-based model, TrackingBERT, for particle tracking, pioneering a foundational model approach in this domain.

Findings

01

TrackingBERT achieves effective particle tracking performance.

02

The model provides latent detector embeddings for auxiliary tasks.

03

First application of language models to particle detector data.

Abstract

Particle tracking is crucial for almost all physics analysis programs at the Large Hadron Collider. Deep learning models are pervasively used in particle tracking related tasks. However, the current practice is to design and train one deep learning model for one task with supervised learning techniques. The trained models work well for tasks they are trained on but show no or little generalization capabilities. We propose to unify these models with a language model. In this paper, we present a tokenized detector representation that allows us to train a BERT model for particle tracking. The trained BERT model, namely TrackingBERT, offers latent detector module embedding that can be used for other tasks. This work represents the first step towards developing a foundational model for particle detector understanding.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
HWresearch/GNN4Colliders
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · WordPiece · Linear Warmup With Linear Decay · Softmax · Multi-Head Attention · Layer Normalization · Dropout · Residual Connection