Ultra Fast Transformers on FPGAs for Particle Physics Experiments

Zhixing Jiang; Dennis Yin; Elham E Khoda; Vladimir Loncar; Ekaterina; Govorkova; Eric Moreno; Philip Harris; Scott Hauck; Shih-Chieh Hsu

arXiv:2402.01047·cs.LG·February 5, 2024·1 cites

Ultra Fast Transformers on FPGAs for Particle Physics Experiments

Zhixing Jiang, Dennis Yin, Elham E Khoda, Vladimir Loncar, Ekaterina, Govorkova, Eric Moreno, Philip Harris, Scott Hauck, Shih-Chieh Hsu

PDF

Open Access

TL;DR

This paper presents an FPGA implementation of transformer components optimized for particle physics experiments, achieving ultra-low latency suitable for real-time trigger systems at CERN.

Contribution

It introduces an efficient FPGA-based transformer implementation using hls4ml, tailored for particle physics applications, with critical components like multi-head attention and softmax.

Findings

01

Latency under 2 microseconds on Xilinx UltraScale+ FPGA

02

Effective application to jet flavor tagging problem

03

Compatible with CERN LHC trigger requirements

Abstract

This work introduces a highly efficient implementation of the transformer architecture on a Field-Programmable Gate Array (FPGA) by using the \texttt{hls4ml} tool. Given the demonstrated effectiveness of transformer models in addressing a wide range of problems, their application in experimental triggers within particle physics becomes a subject of significant interest. In this work, we have implemented critical components of a transformer model, such as multi-head attention and softmax layers. To evaluate the effectiveness of our implementation, we have focused on a particle physics jet flavor tagging problem, employing a public dataset. We recorded latency under 2 $μ$ s on the Xilinx UltraScale+ FPGA, which is compatible with hardware trigger requirements at the CERN Large Hadron Collider experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParticle Detector Development and Performance · Particle accelerators and beam dynamics · Advanced Data Storage Technologies

MethodsAttention Is All You Need · Linear Layer · Softmax · Multi-Head Attention