JetFormer: A Scalable and Efficient Transformer for Jet Tagging from Offline Analysis to FPGA Triggers
Ruoqing Zheng, Chang Sun, Qibin Liu, Lauri Laatu, Arianna Cox, Benedikt Maier, Alexander Tapper, Jose G. F. Coutinho, Wayne Luk, Zhiqiang Que

TL;DR
JetFormer is a scalable Transformer architecture for particle jet tagging at the LHC, achieving high accuracy with reduced computational cost and adaptable to both offline analysis and FPGA-based online triggers.
Contribution
Introduces JetFormer, a versatile Transformer model that performs well across various jet tagging scenarios and is optimized for hardware deployment.
Findings
Matches ParT model accuracy with 37.4% fewer FLOPs
Outperforms MLPs, Deep Sets, and Interaction Networks by 3-4% in accuracy
Enables FPGA deployment with minimal accuracy loss through pruning and quantization
Abstract
We present JetFormer, a versatile and scalable encoder-only Transformer architecture for particle jet tagging at the Large Hadron Collider (LHC). Unlike prior approaches that are often tailored to specific deployment regimes, JetFormer is designed to operate effectively across the full spectrum of jet tagging scenarios, from high-accuracy offline analysis to ultra-low-latency online triggering. The model processes variable-length sets of particle features without relying on input of explicit pairwise interactions, yet achieves competitive or superior performance compared to state-of-the-art methods. On the large-scale JetClass dataset, a large-scale JetFormer matches the accuracy of the interaction-rich ParT model (within 0.7%) while using 37.4% fewer FLOPs, demonstrating its computational efficiency and strong generalization. On benchmark HLS4ML 150P datasets, JetFormer consistently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParticle Detector Development and Performance · Parallel Computing and Optimization Techniques · Particle physics theoretical and experimental studies
