Ara2: Exploring Single- and Multi-Core Vector Processing with an Efficient RVV 1.0 Compliant Open-Source Processor
Matteo Perotti, Matheus Cavalcante, Renzo Andri, Lukas Cavigelli, Luca, Benini

TL;DR
Ara2 is an open-source RISC-V vector processor that demonstrates high performance and energy efficiency on data-parallel workloads, with insights into microarchitecture bottlenecks and multi-core benefits.
Contribution
First fully open-source RISC-V V 1.0 compliant vector processor, with detailed performance, energy, and microarchitectural analysis, including multi-core trade-offs.
Findings
Achieves 95% functional-unit utilization on intensive kernels.
Reaches 37.8 DP-GFLOPS/W energy efficiency at 0.8V.
Multi-core clusters significantly improve performance and energy efficiency.
Abstract
Vector processing is highly effective in boosting processor performance and efficiency for data-parallel workloads. In this paper, we present Ara2, the first fully open-source vector processor to support the RISC-V V 1.0 frozen ISA. We evaluate Ara2's performance on a diverse set of data-parallel kernels for various problem sizes and vector-unit configurations, achieving an average functional-unit utilization of 95% on the most computationally intensive kernels. We pinpoint performance boosters and bottlenecks, including the scalar core, memories, and vector architecture, providing insights into the main vector architecture's performance drivers. Leveraging the openness of the design, we implement Ara2 in a 22nm technology, characterize its PPA metrics on various configurations (2-16 lanes), and analyze its microarchitecture and implementation bottlenecks. Ara2 achieves a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Interconnection Networks and Systems · Advanced Data Storage Technologies
