A 3 TOPS/W RISC-V Parallel Cluster for Inference of Fine-Grain Mixed-Precision Quantized Neural Networks
Alessandro Nadalini, Georg Rutishauser, Alessio Burrello, Nazareno, Bruschi, Angelo Garofalo, Luca Benini, Francesco Conti, Davide Rossi

TL;DR
This paper presents Flex-V, a RISC-V based processor cluster optimized for energy-efficient inference of mixed-precision quantized neural networks, achieving high performance and low area overhead.
Contribution
Introduction of Flex-V, a novel RISC-V processor with fused mixed-precision instructions and a comprehensive deployment framework for efficient QNN inference.
Findings
Achieves up to 3.26 TOPS/W in a 22nm process.
Provides up to 8.5x speed-up over existing solutions.
Demonstrates effective deployment of real-life QNNs.
Abstract
The emerging trend of deploying complex algorithms, such as Deep Neural Networks (DNNs), increasingly poses strict memory and energy efficiency requirements on Internet-of-Things (IoT) end-nodes. Mixed-precision quantization has been proposed as a technique to minimize a DNN's memory footprint and maximize its execution efficiency, with negligible end-to-end precision degradation. In this work, we present a novel hardware and software stack for energy-efficient inference of mixed-precision Quantized Neural Networks (QNNs). We introduce Flex-V, a processor based on the RISC-V Instruction Set Architecture (ISA) that features fused Mac&Load mixed-precision dot product instructions; to avoid the exponential growth of the encoding space due to mixed-precision variants, we encode formats into the Control-Status Registers (CSRs). Flex-V core is integrated into a tightly-coupled cluster of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Advanced Image and Video Retrieval Techniques
