Reduced Precision Strategies for Deep Learning: A High Energy Physics Generative Adversarial Network Use Case
Florian Rehm, Sofia Vallecorsa, Vikram Saletore, Hans Pabst, Adel, Chaibi, Valeriu Codreanu, Kerstin Borras, Dirk Kr\"ucker

TL;DR
This paper explores low precision quantization of deep generative adversarial networks for high energy physics simulations, demonstrating significant speed-up and maintained accuracy using novel and existing quantization tools.
Contribution
It introduces the use of the Intel iLoT tool for quantizing complex GANs in physics, showing improved performance and accuracy preservation over TensorFlow Lite.
Findings
1.73x speed-up on Intel hardware with iLoT quantization
Lower physical accuracy loss with iLoT compared to TensorFlow Lite
Effective quantization of complex physics-based GAN models
Abstract
Deep learning is finding its way into high energy physics by replacing traditional Monte Carlo simulations. However, deep learning still requires an excessive amount of computational resources. A promising approach to make deep learning more efficient is to quantize the parameters of the neural networks to reduced precision. Reduced precision computing is extensively used in modern deep learning and results to lower execution inference time, smaller memory footprint and less memory bandwidth. In this paper we analyse the effects of low precision inference on a complex deep generative adversarial network model. The use case which we are addressing is calorimeter detector simulations of subatomic particle interactions in accelerator based high energy physics. We employ the novel Intel low precision optimization tool (iLoT) for quantization and compare the results to the quantized model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
