XNOR Neural Engine: a Hardware Accelerator IP for 21.6 fJ/op Binary   Neural Network Inference

Francesco Conti; Pasquale Davide Schiavone; Luca Benini

arXiv:1807.03010·cs.NE·July 23, 2018

XNOR Neural Engine: a Hardware Accelerator IP for 21.6 fJ/op Binary Neural Network Inference

Francesco Conti, Pasquale Davide Schiavone, Luca Benini

PDF

1 Repo

TL;DR

The paper presents the XNOR Neural Engine, a digital hardware accelerator for Binary Neural Networks that achieves extremely low energy per operation and can run complex models like ResNet-34 efficiently.

Contribution

Introduction of the XNOR Neural Engine (XNE), a configurable hardware accelerator IP integrated within a microcontroller for efficient BNN inference.

Findings

01

Energy cost per binary operation is 21.6fJ at 0.4V.

02

Can execute ResNet-34 in less than 2.2mJ per frame.

03

Supports autonomous and cooperative layer computation.

Abstract

Binary Neural Networks (BNNs) are promising to deliver accuracy comparable to conventional deep neural networks at a fraction of the cost in terms of memory and energy. In this paper, we introduce the XNOR Neural Engine (XNE), a fully digital configurable hardware accelerator IP for BNNs, integrated within a microcontroller unit (MCU) equipped with an autonomous I/O subsystem and hybrid SRAM / standard cell memory. The XNE is able to fully compute convolutional and dense layers in autonomy or in cooperation with the core in the MCU to realize more complex behaviors. We show post-synthesis results in 65nm and 22nm technology for the XNE IP and post-layout results in 22nm for the full MCU indicating that this system can drop the energy cost per binary operation to 21.6fJ per operation at 0.4V, and at the same time is flexible and performant enough to execute state-of-the-art BNN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pulp-platform/hwpe-tb
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.