Optimizing the performance of Lattice Gauge Theory simulations with Streaming SIMD extensions
Shyam Srinivasan

TL;DR
This paper demonstrates how Streaming SIMD Extensions (SSE) enhance the performance of lattice gauge theory simulations, analyzing factors affecting speed-ups and proposing a performance model to guide future optimizations.
Contribution
It provides an empirical analysis of SSE's impact on simulation speed and introduces a performance model to evaluate bottlenecks and guide hardware/software improvements.
Findings
SSE yields higher speed-ups with single precision than double precision.
Performance improvements are limited by architectural constraints and data retrieval patterns.
A performance model helps identify bottlenecks and predict future hardware benefits.
Abstract
Two factors, which affect simulation quality are the amount of computing power and implementation. The Streaming SIMD (single instruction multiple data) extensions (SSE) present a technique for influencing both by exploiting the processor's parallel functionalism. In this paper, we show how SSE improves performance of lattice gauge theory simulations. We identified two significant trends through an analysis of data from various runs. The speed-ups were higher for single precision than double precision floating point numbers. Notably, though the use of SSE significantly improved simulation time, it did not deliver the theoretical maximum. There are a number of reasons for this: architectural constraints imposed by the FSB speed, the spatial and temporal patterns of data retrieval, ratio of computational to non-computational instructions, and the need to interleave miscellaneous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematics, Computing, and Information Processing · Algorithms and Data Compression · Advanced Data Storage Technologies
