SimdBench: Benchmarking Large Language Models for SIMD-Intrinsic Code Generation

Yibo He; Shuoran Zhao; Jiaming Huang; Yingjie Fu; Hao Yu; Cunjian Huang; Tao Xie

arXiv:2507.15224·cs.SE·July 22, 2025

SimdBench: Benchmarking Large Language Models for SIMD-Intrinsic Code Generation

Yibo He, Shuoran Zhao, Jiaming Huang, Yingjie Fu, Hao Yu, Cunjian Huang, Tao Xie

PDF

TL;DR

SimdBench is a new benchmark designed to evaluate how well large language models generate SIMD-intrinsic code, revealing current limitations and guiding future improvements in this specialized domain.

Contribution

This paper introduces SimdBench, the first dedicated benchmark for SIMD-intrinsic code generation by LLMs, and provides a systematic evaluation of 18 models across five SIMD extensions.

Findings

01

LLMs show decreased pass@k in SIMD-intrinsic code generation

02

Performance varies significantly across different SIMD extensions

03

Insights suggest directions for improving LLMs in vectorized code generation

Abstract

SIMD (Single Instruction Multiple Data) instructions and their compiler intrinsics are widely supported by modern processors to accelerate performance-critical tasks. SIMD intrinsic programming, a trade-off between coding productivity and high performance, is widely used in the development of mainstream performance-critical libraries and daily computing tasks. Large Language Models (LLMs), which have demonstrated strong and comprehensive capabilities in code generation, show promise in assisting programmers with the challenges of SIMD intrinsic programming. However, existing code-generation benchmarks focus on only scalar code, and it is unclear how LLMs perform in generating vectorized code using SIMD intrinsics. To fill this gap, we propose SimdBench, the first code benchmark specifically designed for SIMD-intrinsic code generation, comprising 136 carefully crafted tasks and targeting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.