SAKURAONE: An Open Ethernet-Based AI HPC System and Its Observed Workload Dynamics in a Single-Tenant LLM Development Environment
Fumikazu Konishi, Yuuki Tsubouchi, Hirofumi Tsuruta

TL;DR
SAKURAONE is an open Ethernet-based HPC system optimized for LLM workloads, demonstrating high scalability, performance, and real-world workload dynamics in a single-tenant environment.
Contribution
It introduces SAKURAONE, a fully open networking HPC cluster with detailed performance metrics and workload analysis in a single-tenant LLM development setting.
Findings
Achieved 33.95 PFLOP/s on HPL benchmark.
Observed workload shift from large-scale to mid-scale jobs over time.
Demonstrated scalability of vendor-neutral 800 GbE networking stack.
Abstract
SAKURAONE is a managed high performance computing (HPC) cluster developed and operated by the SAKURA Internet Research Center. It builds on the KOKARYOKU PHY bare metal GPU platform and is optimized for advanced workloads, including large language model (LLM) training. In ISC 2025 TOP500, SAKURAONE is ranked 49th by HPL and is the only top 100 system that uses a fully open networking stack - 800 GbE with SONiC - demonstrating the scalability of vendor-neutral technology. Measured performance is 33.95 PFLOP/s (HPL Rmax), 396.295 TFLOP/s (HPCG), and 339.86 PFLOP/s on HPL-MxP with FP8. The system consists of 100 nodes, each with eight NVIDIA H100 GPUs and a 2 PB all-flash Lustre file system, interconnected via a rail-optimized 800 GbE leaf-spine fabric with RoCEv2. Through exclusive use by a single research project, we observed the characteristics of development-related jobs. Consistent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
