Prototyping a ROOT-based distributed analysis workflow for HL-LHC: the CMS use case
Tommaso Tedeschi (1, 2), Vincenzo Eduardo Padulano (3), Daniele, Spiga (1), Diego Ciangottini (1), Mirco Tracolli (1), Enric Tejedor Saavedra, (3), Enrico Guiraud (3, 5), Massimo Biasotto (4) ((1) INFN Sezione di, Perugia, Via A. Pascoli, 06123 Perugia, Italy

TL;DR
This paper demonstrates a ROOT-based distributed analysis workflow for CMS at HL-LHC, utilizing modern declarative programming and interactive interfaces to improve efficiency and scalability of physics data analysis.
Contribution
It introduces a novel ROOT RDataFrame-based distributed analysis workflow with a web interface, enhancing user experience and performance for CMS physics analyses.
Findings
Order of magnitude speedup over legacy methods
Successful deployment of a web-based interactive analysis platform
Efficient utilization of distributed grid resources
Abstract
The challenges expected for the next era of the Large Hadron Collider (LHC), both in terms of storage and computing resources, provide LHC experiments with a strong motivation for evaluating ways of rethinking their computing models at many levels. Great efforts have been put into optimizing the computing resource utilization for the data analysis, which leads both to lower hardware requirements and faster turnaround for physics analyses. In this scenario, the Compact Muon Solenoid (CMS) collaboration is involved in several activities aimed at benchmarking different solutions for running High Energy Physics (HEP) analysis workflows. A promising solution is evolving software towards more user-friendly approaches featuring a declarative programming model and interactive workflows. The computing infrastructure should keep up with this trend by offering on the one side modern interfaces,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Scientific Computing and Data Management · Particle Detector Development and Performance
