XRISM Pre-Pipeline and Singularity: Container-Based Data Processing for the X-Ray Imaging and Spectroscopy Mission and High-Performance Computing
Satoshi Eguchi, Makoto Tashiro, Yukikatsu Terada, Hiromitsu Takahashi, Masayoshi Nobukawa, Ken Ebisawa, Katsuhiro Hayashi, Tessei Yoshida, Yoshiaki Kanemaru, Shoji Ogawa, Matthew P. Holland, Michael Loewenstein, Eric D. Miller, Tahir Yaqoob, Robert S. Hill, Morgan D. Waddy

TL;DR
This paper presents a container-based data processing system for the XRISM X-ray observatory, achieving significant speedups using Singularity containers on high-performance computing infrastructure.
Contribution
It introduces a containerized pre-pipeline system utilizing Singularity for XRISM data processing, enhancing performance on HPC systems.
Findings
33x speedup in data processing tasks
Successful porting of XRISM pre-pipeline to HPC environment
Effective use of Singularity containers on high-performance systems
Abstract
The X-Ray Imaging and Spectroscopy Mission (XRISM) is the seventh Japanese X-ray observatory whose development and operation are in collaboration with universities and research institutes in Japan, the United States, and Europe, including JAXA, NASA, and ESA. The telemetry data downlinked from the satellite are reduced to scientific products using pre-pipeline (PPL) and pipeline (PL) software running on standard Linux virtual machines (VMs) for the JAXA and NASA sides, respectively. OBSIDs identified the observations, and we had 80 and 161 OBSIDs to be reprocessed at the end of the commissioning period and performance verification and calibration period, respectively. The combination of the containerized PPL utilizing Singularity of a container platform running on the JAXA's "TOKI-RURI" high-performance computing (HPC) system and working disk images formatted to ext3 accomplished a 33x…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Scientific Computing and Data Management · Particle Detector Development and Performance
