TL;DR
This paper introduces MIParT, a novel particle transformer with a new attention mechanism that achieves high accuracy in jet tagging while reducing model complexity and enhancing performance with large datasets.
Contribution
The paper presents MIParT, a new deep learning model with a unique attention mechanism that improves jet tagging accuracy and efficiency over existing methods, especially on large datasets.
Findings
MIParT matches LorentzNet accuracy and outperforms ParT in background rejection.
MIParT reduces model parameters and computational complexity.
Large dataset training with MIParT-L further improves performance.
Abstract
In this study, we introduce the More-Interaction Particle Transformer (MIParT), a novel deep learning neural network designed for jet tagging. This framework incorporates our own design, the More-Interaction Attention (MIA) mechanism, which increases the dimensionality of particle interaction embeddings. We tested MIParT using the top tagging and quark-gluon datasets. Our results show that MIParT not only matches the accuracy and AUC of LorentzNet and a series of Lorentz-equivariant methods, but also significantly outperforms the ParT model in background rejection. Specifically, it improves background rejection by approximately 25% at a 30% signal efficiency on the top tagging dataset and by 3% on the quark-gluon dataset. Additionally, MIParT requires only 30% of the parameters and 53% of the computational complexity needed by ParT, proving that high performance can be achieved with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
