Dwarfs on Accelerators: Enhancing OpenCL Benchmarking for Heterogeneous Computing Architectures
Beau Johnston, Josh Milthorpe

TL;DR
This paper extends the OpenDwarfs OpenCL benchmark suite to better evaluate heterogeneous computing architectures, emphasizing robustness, correctness, and diverse hardware performance analysis.
Contribution
It introduces an enhanced OpenCL benchmark suite with improved robustness and correctness focus, enabling comprehensive performance evaluation across heterogeneous architectures.
Findings
Performance varies significantly across architectures
Benchmark suite effectively captures hardware differences
Enhanced suite improves reliability of results
Abstract
For reasons of both performance and energy efficiency, high-performance computing (HPC) hardware is becoming increasingly heterogeneous. The OpenCL framework supports portable programming across a wide range of computing devices and is gaining influence in programming next-generation accelerators. To characterize the performance of these devices across a range of applications requires a diverse, portable and configurable benchmark suite, and OpenCL is an attractive programming model for this purpose. We present an extended and enhanced version of the OpenDwarfs OpenCL benchmark suite, with a strong focus placed on the robustness of applications, curation of additional benchmarks with an increased emphasis on correctness of results and choice of problem size. Preliminary results and analysis are reported for eight benchmark codes on a diverse set of architectures -- three Intel CPUs,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Advanced Data Storage Technologies
