SimdBench: Benchmarking Large Language Models for SIMD-Intrinsic Code Generation
Yibo He, Shuoran Zhao, Jiaming Huang, Yingjie Fu, Hao Yu, Cunjian Huang, Tao Xie

TL;DR
SimdBench is a new benchmark designed to evaluate how well large language models generate SIMD-intrinsic code, revealing current limitations and guiding future improvements in this specialized domain.
Contribution
This paper introduces SimdBench, the first dedicated benchmark for SIMD-intrinsic code generation by LLMs, and provides a systematic evaluation of 18 models across five SIMD extensions.
Findings
LLMs show decreased pass@k in SIMD-intrinsic code generation
Performance varies significantly across different SIMD extensions
Insights suggest directions for improving LLMs in vectorized code generation
Abstract
SIMD (Single Instruction Multiple Data) instructions and their compiler intrinsics are widely supported by modern processors to accelerate performance-critical tasks. SIMD intrinsic programming, a trade-off between coding productivity and high performance, is widely used in the development of mainstream performance-critical libraries and daily computing tasks. Large Language Models (LLMs), which have demonstrated strong and comprehensive capabilities in code generation, show promise in assisting programmers with the challenges of SIMD intrinsic programming. However, existing code-generation benchmarks focus on only scalar code, and it is unclear how LLMs perform in generating vectorized code using SIMD intrinsics. To fill this gap, we propose SimdBench, the first code benchmark specifically designed for SIMD-intrinsic code generation, comprising 136 carefully crafted tasks and targeting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
