Quark/Gluon Discrimination and Top Tagging with Dual Attention Transformer
Minxuan He, Daohan Wang

TL;DR
This paper introduces the Particle Dual Attention Transformer (P-DAT), a novel deep learning architecture that enhances jet tagging by capturing both global and local particle interactions efficiently.
Contribution
The study presents a new transformer-based architecture with dual attention mechanisms for improved jet tagging and particle interaction modeling.
Findings
Achieves competitive performance in top tagging tasks.
Effectively captures both particle-level and jet-level interactions.
Demonstrates efficiency in computational resources.
Abstract
Jet tagging is a crucial classification task in high energy physics. Recently the performance of jet tagging has been significantly improved by the application of deep learning techniques. In this study, we introduce a new architecture for jet tagging: the Particle Dual Attention Transformer (P-DAT). This novel transformer architecture stands out by concurrently capturing both global and local information, while maintaining computational efficiency. Regarding the self attention mechanism, we have extended the established attention mechanism between particles to encompass the attention mechanism between particle features. The particle attention module computes particle level interactions across all the particles, while the channel attention module computes attention scores between particle features, which naturally captures jet level interactions by taking all particles into account.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParticle physics theoretical and experimental studies · Particle Detector Development and Performance · High-Energy Particle Collisions Research
