SubMIT: A Physics Analysis Facility at MIT
Josh Bendavid (1, 2), Mariarosaria D'Alfonso (1), Jan Eysermans (1), Chad Freer (1), Maxim Goncharov (1), Matthew Heine (1), Luca Lavezzo (1), Marianne Moore (1), Christoph Paus (1), Xuejian Shen (1), David Walter (1)

TL;DR
SubMIT is a high-performance data analysis platform at MIT that combines fast data access, large-scale processing, and flexible computing resources, designed to support future high-energy physics experiments.
Contribution
The paper introduces SubMIT, a novel analysis facility integrating high-speed data access, large-scale processing, and flexible user environments for advanced physics data analysis.
Findings
Achieves 100Gbps network speeds per server
Supports diverse hardware including CPUs, GPUs, and containers
Demonstrates effective benchmarking with real-life analyses
Abstract
The recently completed SubMIT platform is a small set of servers that provide interactive access to substantial data samples at high speeds, enabling sophisticated data analyses with very fast turnaround times. Additionally, it seamlessly integrates massive processing resources for large-scale tasks by connecting to a set of powerful batch processing systems. It serves as an ideal prototype for an Analysis Facility tailored to meet the demanding data and computational requirements anticipated during the High-Luminosity phase of the Large Hadron Collider. The key features that make this facility so powerful include highly optimized data access with a minimum of 100Gbps networking per server, a large managed NVMe storage system, and a substantial spinning-disk Ceph file system. The platform integrates a diverse set of high multicore CPU machines for tasks benefiting from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParticle physics theoretical and experimental studies · Scientific Computing and Data Management · Distributed and Parallel Computing Systems
