FirecREST v2: lessons learned from redesigning an API for scalable HPC resource access
Elia Palme (1), Juan Pablo Dorsch (1), Ali Khosravi (2), Giovanni Pizzi (2), Francesco Pagnamenta (1), Andrea Ceriani (1), Eirini Koutsaniti (1), Rafael Sarmiento (1), Ivano Bonesana (1), Alejandro Dabin (1) ((1) CSCS - Swiss National Supercomputing Centre. Lugano

TL;DR
FirecREST v2 significantly improves HPC resource access API performance by 100x through a comprehensive redesign focused on security and throughput, with detailed testing and validation.
Contribution
This paper presents a complete redesign of FirecREST API, achieving major performance gains and integrating security and high throughput as core features.
Findings
100x performance improvement over previous version
Identification of common bottlenecks in proxy-based APIs
Validated performance gains through independent peer review
Abstract
Introducing FirecREST v2, the next generation of our open-source RESTful API for programmatic access to HPC resources. FirecREST v2 delivers a 100x performance improvement over its predecessor. This paper explores the lessons learned from redesigning FirecREST from the ground up, with a focus on integrating enhanced security and high throughput as core requirements. We provide a detailed account of our systematic performance testing methodology, highlighting common bottlenecks in proxy-based APIs with intensive I/O operations. Key design and architectural changes that enabled these performance gains are presented. Finally, we demonstrate the impact of these improvements, supported by independent peer validation, and discuss opportunities for further improvements.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Security and Verification in Computing · Distributed and Parallel Computing Systems
