Distributed Offline Data Reconstruction in BaBar
Teela Pulliam, Peter Elmer, Alvise Dorigo

TL;DR
This paper describes the development and deployment of a distributed, two-pass data reconstruction system for the BaBar experiment, enabling efficient processing and off-site distribution of large-scale data with high performance.
Contribution
It introduces a novel two-pass, distributed processing architecture for large-scale data reconstruction, including off-site processing and system performance details.
Findings
Successfully processed 90 TB dataset from SLAC
Achieved processing of 2-4 million events per day on distributed farms
Demonstrated effective off-site data processing and quality control
Abstract
The BaBar experiment at SLAC is in its fourth year of running. The data processing system has been continuously evolving to meet the challenges of higher luminosity running and the increasing bulk of data to re-process each year. To meet these goals a two-pass processing architecture has been adopted, where 'rolling calibrations' are quickly calculated on a small fraction of the events in the first pass and the bulk data reconstruction done in the second. This allows for quick detector feedback in the first pass and allows for the parallelization of the second pass over two or more separate farms. This two-pass system allows also for distribution of processing farms off-site. The first such site has been setup at INFN Padova. The challenges met here were many. The software was ported to a full Linux-based, commodity hardware system. The raw dataset, 90 TB, was imported from SLAC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNuclear Physics and Applications · Medical Imaging Techniques and Applications · Radiation Detection and Scintillator Technologies
