Efficient Implementation of RISC-V Vector Permutation Instructions
Vasileios Titopoulos, George Alexakis, Chrysostomos Nicopoulos, Giorgos Dimitrakopoulos

TL;DR
This paper presents a unified microarchitecture for efficiently executing all RISC-V vector permutation instructions, reducing hardware costs and maintaining fixed latency, validated through implementation in a 7 nm open-source processor.
Contribution
A novel unified microarchitecture design that efficiently executes all RVV permutation instructions with minimal area overhead in a RISC-V vector processor.
Findings
Only 1.5% area overhead in the vector processor.
Area overhead decreases as minimum element width increases.
Supports single-cycle execution for vectors up to 256 bits.
Abstract
RISC-V CPUs leverage the RVV (RISC-V Vector) extension to accelerate data-parallel workloads. In addition to arithmetic operations, RVV includes powerful permutation instructions that enable flexible element rearrangement within vector registers --critical for optimizing performance in tasks such as matrix operations and cryptographic computations. However, the diverse control mechanisms of these instructions complicate their execution within a unified datapath while maintaining the fixed-latency requirement of cryptographic accelerators. To address this, we propose a unified microarchitecture capable of executing all RVV permutation instructions efficiently, regardless of their control information structure. This approach minimizes area and hardware costs while ensuring single-cycle execution for short vector machines (up to 256 bits) and enabling efficient pipelining for longer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCoding theory and cryptography · graph theory and CDMA systems · Interconnection Networks and Systems
