BOSS: a tool for batch job monitoring and book-keeping
C.Grandi

TL;DR
BOSS is a tool designed for real-time monitoring and bookkeeping of batch jobs in compute farms, storing detailed job info in a relational database for efficient access and management.
Contribution
It introduces a system that captures, filters, and stores job data in a structured database, facilitating job management in grid environments.
Findings
Successfully used by CMS Regional Centers for Monte Carlo data production.
Demonstrated effective operation within the European DataGrid test bed.
Enabled real-time job monitoring and data bookkeeping.
Abstract
BOSS (Batch Object Submission System) has been developed to provide real-time monitoring and bookkeeping of jobs submitted to a compute farm system. The information is persistently stored in a relational database (MySQL in the current version) for further processing. By means of user-supplied filters, BOSS extracts the specific job information to be monitored from the standard input, output and error of the job itself and stores it in the database in a structured form that allows easy and efficient access. BOSS has been successfully used by all CMS Regional Centers for managing Monte Carlo data productions in 2002. Furthermore in fall 2002 it has been used in a prototype of the CMS production system deployed on the European DataGrid test bed demonstrating its ability to be used also in a grid environment.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Cloud Computing and Resource Management · Software System Performance and Reliability
