Extending the Fermi-LAT Data Processing Pipeline to the Grid
Stephan Zimmer, Luisa Arrabito, Tom Glanzman, Tony Johnson, Claudia, Lavalley, Andrei Tsaregorodtsev

TL;DR
This paper describes the extension of the Fermi-LAT data processing pipeline to incorporate grid computing resources, enhancing automation, monitoring, and resource utilization for large-scale data analysis.
Contribution
It introduces a new interface to Grid systems, enabling efficient use of distributed computing resources for Fermi-LAT data processing.
Findings
Successful integration with Grid resources via Dirac system
Enhanced monitoring and workflow management capabilities
Improved scalability and resource utilization
Abstract
The Data Handling Pipeline ("Pipeline") has been developed for the Fermi Gamma-Ray Space Telescope (Fermi) Large Area Telescope (LAT) which launched in June 2008. Since then it has been in use to completely automate the production of data quality monitoring quantities, reconstruction and routine analysis of all data received from the satellite and to deliver science products to the collaboration and the Fermi Science Support Center. Aside from the reconstruction of raw data from the satellite (Level 1), data reprocessing and various event-level analyses are also reasonably heavy loads on the pipeline and computing resources. These other loads, unlike Level 1, can run continuously for weeks or months at a time. In addition it receives heavy use in performing production Monte Carlo tasks. The software comprises web-services that allow online monitoring and provides charts summarizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
