CMS Data Analysis: Current Status and Future Strategy
Vincenzo Innocente

TL;DR
This paper reviews CMS data analysis architecture, current tools, and prototypes for future Grid-based distributed analysis, aiming to enhance analysis efficiency and scalability in high-energy physics research.
Contribution
It introduces the current CMS analysis frameworks and discusses ongoing development of Grid-enabled analysis prototypes to improve distributed data processing.
Findings
CMS analysis frameworks COBRA and IGUANA are actively used worldwide.
Development of Grid-based analysis prototypes is underway to enhance distributed processing.
Initial prototypes based on Clarens are being tested by CMS physicists.
Abstract
We present the current status of CMS data analysis architecture and describe work on future Grid-based distributed analysis prototypes. CMS has two main software frameworks related to data analysis: COBRA, the main framework, and IGUANA, the interactive visualisation framework. Software using these frameworks is used today in the world-wide production and analysis of CMS data. We describe their overall design and present examples of their current use with emphasis on interactive analysis. CMS is currently developing remote analysis prototypes, including one based on Clarens, a Grid-enabled client-server tool. Use of the prototypes by CMS physicists will guide us in forming a Grid-enriched analysis strategy. The status of this work is presented, as is an outline of how we plan to leverage the power of our existing frameworks in the migration of CMS software to the Grid.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems
