Continuous benchmarking: Keeping pace with an evolving ecosystem of models and technologies

Jan Vogelsang; Melissa Lober; Catherine Mia Sch\"ofmann; Jos\'e Villamar; Dennis Terhorst; Johanna Senk; Hans Ekkehard Plesser; Markus Diesmann; Susanne Kunkel; Anno C. Kurth

arXiv:2604.15919·cs.DC·April 22, 2026

Continuous benchmarking: Keeping pace with an evolving ecosystem of models and technologies

Jan Vogelsang, Melissa Lober, Catherine Mia Sch\"ofmann, Jos\'e Villamar, Dennis Terhorst, Johanna Senk, Hans Ekkehard Plesser, Markus Diesmann, Susanne Kunkel, Anno C. Kurth

PDF

TL;DR

This paper introduces an automated, continuous benchmarking pipeline designed to adapt to rapidly evolving models and technologies, emphasizing reproducibility, collaboration, and sustainability in high-performance computing for neuroscience and AI.

Contribution

It extends previous benchmarking workflows with user-agnostic and continuous benchmarking features, facilitating sustainable progress in research software development.

Findings

01

Enhanced reproducibility and re-use of benchmarking results

02

Software solutions for rapid adaptation to evolving models and systems

03

Support for collaborative and customizable benchmarking workflows

Abstract

Drawing on ideas from continuous integration, we present concepts of an automated benchmarking pipeline for high performance applications. Customization and collaboration have been key design goals owing to the requirements of research-software development as a continuous community effort. We have extended our previous conceptual work on systematic benchmarking workflows with the functionality of user-agnostic operations as well as continuous benchmarking. This fosters reproducibility and re-use of benchmarking results to ensure sustainable technological progress. We provide software-engineering solutions to keep pace with the rapid evolution of both large-scale models and high-performance computing systems with a view towards the scientific domains of neuroscience and artificial intelligence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.