GPU-accelerated machine learning inference as a service for computing in   neutrino experiments

Michael Wang; Tingjun Yang; Maria Acosta Flechas; Philip Harris,; Benjamin Hawks; Burt Holzman; Kyle Knoepfel; Jeffrey Krupa; Kevin Pedro; Nhan; Tran

arXiv:2009.04509·physics.comp-ph·March 25, 2021

GPU-accelerated machine learning inference as a service for computing in neutrino experiments

Michael Wang, Tingjun Yang, Maria Acosta Flechas, Philip Harris,, Benjamin Hawks, Burt Holzman, Kyle Knoepfel, Jeffrey Krupa, Kevin Pedro, Nhan, Tran

PDF

TL;DR

This paper presents SONIC, a GPU-accelerated web service for neutrino experiment data processing that significantly speeds up event reconstruction tasks, reducing total processing time and costs.

Contribution

We introduce a GPU-based inference service integrated into neutrino data processing, achieving substantial speedups and cost efficiency without disrupting existing workflows.

Findings

01

Track and shower hit identification accelerated by 17x

02

Total processing time reduced by a factor of 2.7

03

Only 1 GPU per 68 CPU threads needed for efficiency

Abstract

Machine learning algorithms are becoming increasingly prevalent and performant in the reconstruction of events in accelerator-based neutrino experiments. These sophisticated algorithms can be computationally expensive. At the same time, the data volumes of such experiments are rapidly increasing. The demand to process billions of neutrino events with many machine learning algorithm inferences creates a computing challenge. We explore a computing model in which heterogeneous computing with GPU coprocessors is made available as a web service. The coprocessors can be efficiently and elastically deployed to provide the right amount of computing for a given processing task. With our approach, Services for Optimized Network Inference on Coprocessors (SONIC), we integrate GPU acceleration specifically for the ProtoDUNE-SP reconstruction chain without disrupting the native computing workflow.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.