Larq Compute Engine: Design, Benchmark, and Deploy State-of-the-Art   Binarized Neural Networks

Tom Bannink; Arash Bakhtiari; Adam Hillier; Lukas Geiger; Tim de; Bruin; Leon Overweel; Jelmer Neeven; Koen Helwegen

arXiv:2011.09398·cs.LG·February 23, 2021

Larq Compute Engine: Design, Benchmark, and Deploy State-of-the-Art Binarized Neural Networks

Tom Bannink, Arash Bakhtiari, Adam Hillier, Lukas Geiger, Tim de, Bruin, Leon Overweel, Jelmer Neeven, Koen Helwegen

PDF

1 Repo

TL;DR

Larq Compute Engine (LCE) is a highly optimized inference engine for Binarized Neural Networks that accelerates binary operations, supports hybrid models, and enables the design of faster, more accurate BNN architectures like QuickNet for computer vision tasks.

Contribution

This work introduces LCE, the fastest BNN inference engine, and presents QuickNet, a new state-of-the-art BNN architecture optimized for speed and accuracy on ImageNet.

Findings

01

LCE accelerates binary convolutions by 8.5-18.5x on Pixel 1 phones.

02

QuickNet outperforms existing binary networks in latency and accuracy.

03

Empirical analysis links model latency to MAC count and the use of full-precision shortcuts.

Abstract

We introduce Larq Compute Engine, the world's fastest Binarized Neural Network (BNN) inference engine, and use this framework to investigate several important questions about the efficiency of BNNs and to design a new state-of-the-art BNN architecture. LCE provides highly optimized implementations of binary operations and accelerates binary convolutions by 8.5 - 18.5x compared to their full-precision counterparts on Pixel 1 phones. LCE's integration with Larq and a sophisticated MLIR-based converter allow users to move smoothly from training to deployment. By extending TensorFlow and TensorFlow Lite, LCE supports models which combine binary and full-precision layers, and can be easily integrated into existing applications. Using LCE, we analyze the performance of existing BNN computer vision architectures and develop QuickNet, a simple, easy-to-reproduce BNN that outperforms existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gholmes829/BinCNN
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.