BinarEye: An Always-On Energy-Accuracy-Scalable Binary CNN Processor With All Memory On Chip in 28nm CMOS
Bert Moons, Daniel Bankman, Lita Yang, Boris Murmann, Marian Verhelst

TL;DR
BinarEye is a highly efficient, on-chip binary CNN processor that offers flexible energy-accuracy trade-offs, enabling real-time applications with minimal power consumption and no off-chip memory bandwidth.
Contribution
It introduces a fully on-chip memory binary CNN processor with flexible reconfiguration, achieving unprecedented energy efficiency for always-on applications.
Findings
Achieves 230 TOPS/W peak efficiency.
Supports flexible network configurations for energy-accuracy trade-offs.
Outperforms state-of-the-art by 3-70x in efficiency at similar accuracy.
Abstract
This paper introduces BinarEye: a digital processor for always-on Binary Convolutional Neural Networks. The chip maximizes data reuse through a Neuron Array exploiting local weight Flip-Flops. It stores full network models and feature maps and hence requires no off-chip bandwidth, which leads to a 230 1b-TOPS/W peak efficiency. Its 3 levels of flexibility - (a) weight reconfiguration, (b) a programmable network depth and (c) a programmable network width - allow trading energy for accuracy depending on the task's requirements. BinarEye's full system input-to-label energy consumption ranges from 14.4uJ/f for 86% CIFAR-10 and 98% owner recognition down to 0.92uJ/f for 94% face detection at up to 1700 frames per second. This is 3-12-70x more efficient than the state-of-the-art at on-par accuracy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
