PULP-NN: Accelerating Quantized Neural Networks on Parallel   Ultra-Low-Power RISC-V Processors

Angelo Garofalo; Manuele Rusci; Francesco Conti; Davide Rossi; Luca; Benini

arXiv:1908.11263·cs.NE·August 30, 2019

PULP-NN: Accelerating Quantized Neural Networks on Parallel Ultra-Low-Power RISC-V Processors

Angelo Garofalo, Manuele Rusci, Francesco Conti, Davide Rossi, Luca, Benini

PDF

1 Repo

TL;DR

PULP-NN is a specialized library that accelerates quantized neural network inference on ultra-low-power RISC-V clusters, achieving significant performance and energy efficiency improvements over existing solutions.

Contribution

It introduces optimized kernels for quantized neural networks on RISC-V processors, exploiting DSP extensions and parallelism for enhanced inference speed and energy efficiency.

Findings

01

Up to 15.5 MACs/cycle on INT-8 quantization.

02

Performance improvement of up to 63x over sequential baseline.

03

Significant reductions in inference cycles and energy consumption compared to ARM-based solutions.

Abstract

We present PULP-NN, an optimized computing library for a parallel ultra-low-power tightly coupled cluster of RISC-V processors. The key innovation in PULP-NN is a set of kernels for Quantized Neural Network (QNN) inference, targeting byte and sub-byte data types, down to INT-1, tuned for the recent trend toward aggressive quantization in deep neural network inference. The proposed library exploits both the digital signal processing (DSP) extensions available in the PULP RISC-V processors and the cluster's parallelism, achieving up to 15.5 MACs/cycle on INT-8 and improving performance by up to 63x with respect to a sequential implementation on a single RISC-V core implementing the baseline RV32IMC ISA. Using PULP-NN, a CIFAR-10 network on an octa-core cluster runs in 30x and 19.6x less clock cycles than the current state-of-the-art ARM CMSIS-NN library, running on STM32L4 and STM32H7…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pulp-platform/pulp-nn
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.