DIRAC - Distributed Infrastructure with Remote Agent Control
N.Brook, A.Bogdanchikov, A.Buckley, J.Closier, U.Egede, M.Frank,, D.Galli, M.Gandelman, V.Garonne, C.Gaspar, R.Graciani Diaz, K.Harrison, E.van, Herwijnen, A.Khan, S.Klous, I.Korolko, G.Kuznetsov, F.Loverre, U.Marconi,, J.P.Palacios, G.N.Patrick, A.Pickford, S.Ponce

TL;DR
DIRAC is a distributed system for LHCb Monte Carlo production that uses a client/server architecture with software agents for job management, successfully deployed across multiple institutes and adaptable to various grid environments.
Contribution
The paper introduces DIRAC, a flexible, agent-based distributed infrastructure for high-energy physics data processing, capable of operating across diverse grid middleware environments.
Findings
Successfully installed at 18 institutes
Used in recent Physics Data Challenges
Supports mixed grid environments
Abstract
This paper describes DIRAC, the LHCb Monte Carlo production system. DIRAC has a client/server architecture based on: Compute elements distributed among the collaborating institutes; Databases for production management, bookkeeping (the metadata catalogue) and software configuration; Monitoring and cataloguing services for updating and accessing the databases. Locally installed software agents implemented in Python monitor the local batch queue, interrogate the production database for any outstanding production requests using the XML-RPC protocol and initiate the job submission. The agent checks and, if necessary, installs any required software automatically. After the job has processed the events, the agent transfers the output data and updates the metadata catalogue. DIRAC has been successfully installed at 18 collaborating institutes, including the DataGRID, and has been used in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Advanced Data Storage Technologies · Scientific Computing and Data Management
