Improved performance of QCD code on ALiCE
Z. Sroczynski

TL;DR
This paper reports on optimizing QCD code performance on the ALiCE cluster through techniques like assembler kernel metaprogramming, data layout improvements, and communication overhead analysis.
Contribution
It introduces specific optimization techniques for QCD computations on a Linux cluster, enhancing performance.
Findings
Optimized QCD code performance achieved on ALiCE cluster
Metaprogramming of assembler kernels improved computational efficiency
Analysis of communication overheads informed optimization strategies
Abstract
We present results for the performance of QCD code on ALiCE, the Alpha-Linux Cluster Engine at Wuppertal. We describe the techniques employed to optimise the code, including the metaprogramming of assembler kernels, the effects of data layout and an investigation into the overheads incurred by the communication.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
