DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural   Networks for Edge Vision Applications

Tao Luo; Wai Teng Tang; Matthew Kay Fei Lee; Chuping Qu; Weng-Fai; Wong; Rick Goh

arXiv:2105.11848·cs.CV·May 26, 2021

DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications

Tao Luo, Wai Teng Tang, Matthew Kay Fei Lee, Chuping Qu, Weng-Fai, Wong, Rick Goh

PDF

Open Access

TL;DR

This paper introduces DTNN, a dendrite tree inspired neural network architecture that significantly reduces energy consumption during inference on edge devices by eliminating costly operations through activation quantization and table lookup, maintaining accuracy.

Contribution

The paper proposes a novel DTNN architecture that enables energy-efficient inference by removing weight access and arithmetic computations, validated across multiple models and datasets.

Findings

01

Achieved up to 64.9X energy savings on VGG-11 with ImageNet.

02

Maintained negligible accuracy loss across various models and datasets.

03

Demonstrated superior energy efficiency and latency on FPGA implementations.

Abstract

Deep neural networks (DNN) have achieved remarkable success in computer vision (CV). However, training and inference of DNN models are both memory and computation intensive, incurring significant overhead in terms of energy consumption and silicon area. In particular, inference is much more cost-sensitive than training because training can be done offline with powerful platforms, while inference may have to be done on battery powered devices with constrained form factors, especially for mobile or edge vision applications. In order to accelerate DNN inference, model quantization was proposed. However previous works only focus on the quantization rate without considering the efficiency of operations. In this paper, we propose Dendrite-Tree based Neural Network (DTNN) for energy-efficient inference with table lookup operations enabled by activation quantization. In DTNN both costly weight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Advanced Memory and Neural Computing

MethodsAverage Pooling · 1x1 Convolution · Batch Normalization · Residual Block · Kaiming Initialization · Residual Connection · Bottleneck Residual Block · Global Average Pooling · Max Pooling · Dense Connections