CMSSW Scaling Limits on Many-Core Machines
Christopher Jones (1), Patrick Gartung (1) ((1) Fermilab)

TL;DR
This paper evaluates the scalability of the CMS software framework (CMSSW) on modern many-core CPUs, analyzing its performance and limitations in high-concurrency environments for particle physics data processing.
Contribution
It provides a detailed review of CMSSW's concurrency model and measures its scalability on many-core architectures using real CMS applications.
Findings
CMSSW's threading efficiency has improved over time.
I/O is identified as the major scaling limitation.
Performance metrics include event throughput and memory usage.
Abstract
Today the LHC offline computing relies heavily on CPU resources, despite the interest in compute accelerators, such as GPUs, for the longer term future. The number of cores per CPU socket has continued to increase steadily, reaching the levels of 64 cores (128 threads) with recent AMD EPYC processors, and 128 cores on Ampere Altra Max ARM processors. Over the course of the past decade, the CMS data processing framework, CMSSW, has been transformed from a single-threaded framework into a highly concurrent one. The first multithreaded version was brought into production by the start of the LHC Run 2 in 2015. Since then, the framework's threading efficiency has gradually been improved by adding more levels of concurrency and reducing the amount of serial code paths. The latest addition was support for concurrent Runs. In this work we review the concurrency model of the CMSSW, and measure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed and Parallel Computing Systems · Parallel Computing and Optimization Techniques
