Vector-Processing for Mobile Devices: Benchmark and Analysis
Alireza Khadem, Daichi Fujiki, Nishil Talati, Scott Mahlke, Reetuparna, Das

TL;DR
This paper introduces Swan, a comprehensive benchmark suite for evaluating vector processing in mobile devices, analyzing its performance, power, and energy impacts across diverse mobile workloads.
Contribution
The paper presents Swan, the first extensive benchmark suite for vector processing in mobile applications, along with a detailed analysis of its performance and energy characteristics.
Findings
Vectorized workloads increase cache pressure due to more memory requests.
Vector processing benefits are higher for low-precision, cache-friendly workloads.
Limited parallelism and strided memory access hinder scaling of vector processing.
Abstract
Vector processing has become commonplace in today's CPU microarchitectures. Vector instructions improve performance and energy which is crucial for resource-constraint mobile devices. The research community currently lacks a comprehensive benchmark suite to study the benefits of vector processing for mobile devices. This paper presents Swan-an extensive vector processing benchmark suite for mobile applications. Swan consists of a diverse set of data-parallel workloads from four commonly used mobile applications: operating system, web browser, audio/video messaging application, and PDF rendering engine. Using Swan benchmark suite, we conduct a detailed analysis of the performance, power, and energy consumption of vectorized workloads, and show that: (a) Vectorized kernels increase the pressure on cache hierarchy due to the higher rate of memory requests. (b) Vector processing is more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Parallel Computing and Optimization Techniques · Advanced Data Storage Technologies
