Spatz: Clustering Compact RISC-V-Based Vector Units to Maximize Computing Efficiency
Matteo Perotti, Samuel Riedel, Matheus Cavalcante, Luca Benini

TL;DR
Spatz introduces a compact, energy-efficient RISC-V vector processor with a small register file, enabling high performance and efficiency in clustered architectures for modern computational workloads.
Contribution
The paper presents Spatz, a novel compact vector processor architecture based on RISC-V, demonstrating high efficiency with a minimal register file size and scalable clustering for improved performance.
Findings
Achieves 15.7 DP-GFLOPS at 1 GHz with 95.7 DP-GFLOPS/W efficiency.
Reaches 95% FPU utilization on matrix multiplication workloads.
Outperforms scalar core clusters in energy efficiency by 30% at equal area.
Abstract
The ever-increasing computational and storage requirements of modern applications and the slowdown of technology scaling pose major challenges to designing and implementing efficient computer architectures. To mitigate the bottlenecks of typical processor-based architectures on both the instruction and data sides of the memory, we present Spatz, a compact 64-bit floating-point-capable vector processor based on RISC-V's Vector Extension Zve64d. Using Spatz as the main Processing Element (PE), we design an open-source dual-core vector processor architecture based on a modular and scalable cluster sharing a Scratchpad Memory (SCM). Unlike typical vector processors, whose Vector Register Files (VRFs) are hundreds of KiB large, we prove that Spatz can achieve peak energy efficiency with a latch-based VRF of only 2 KiB. An implementation of the Spatz-based cluster in GlobalFoundries' 12LPP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Interconnection Networks and Systems
