Collecting and Presenting Reproducible Intranode Stencil Performance:   INSPECT

Julian Hornich; Julian Hammer; Georg Hager; Thomas Gruber; Gerhard; Wellein

arXiv:1906.08138·cs.PF·June 25, 2020

Collecting and Presenting Reproducible Intranode Stencil Performance: INSPECT

Julian Hornich, Julian Hammer, Georg Hager, Thomas Gruber, Gerhard, Wellein

PDF

TL;DR

This paper introduces INSPECT, an open-source framework for reproducible performance measurement and modeling of stencil algorithms across various hardware architectures, aiding developers in performance assessment and optimization.

Contribution

It presents a generalizable methodology, tools, and a collection of results for reproducible intranode stencil performance evaluation across multiple architectures.

Findings

01

Reproducible performance data for multiple stencil patterns

02

Validated performance models across different hardware

03

Open-source toolchain for performance analysis

Abstract

Stencil algorithms have been receiving considerable interest in HPC research for decades. The techniques used to approach multi-core stencil performance modeling and engineering span basic runtime measurements, elaborate performance models, detailed hardware counter analysis, and thorough scaling behavior evaluation. Due to the plurality of approaches and stencil patterns, we set out to develop a generalizable methodology for reproducible measurements accompanied by state-of-the-art performance models. Our open-source toolchain, and collected results are publicly available in the "Intranode Stencil Performance Evaluation Collection" (INSPECT). We present the underlying methodologies, models and tools involved in gathering and documenting the performance behavior of a collection of typical stencil patterns across multiple architectures and hardware configuration options. Our aim is to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.