Fully Dynamic Inference with Deep Neural Networks

Wenhan Xia; Hongxu Yin; Xiaoliang Dai; Niraj K. Jha

arXiv:2007.15151·cs.CV·July 31, 2020·1 cites

Fully Dynamic Inference with Deep Neural Networks

Wenhan Xia, Hongxu Yin, Xiaoliang Dai, Niraj K. Jha

PDF

Open Access

TL;DR

This paper introduces LC-Net, a dynamic inference framework for deep neural networks that adaptively skips redundant layers and filters per input, significantly reducing computation while improving accuracy.

Contribution

It proposes a novel joint dynamic inference approach with L-Net and C-Net that adaptively selects network components per input, outperforming existing methods in efficiency and accuracy.

Findings

01

Up to 11.9× fewer FLOPs on CIFAR-10 with higher accuracy.

02

Achieves up to 1.4× fewer FLOPs on ImageNet with better Top-1 accuracy.

03

Outperforms state-of-the-art dynamic inference frameworks in efficiency and accuracy.

Abstract

Modern deep neural networks are powerful and widely applicable models that extract task-relevant information through multi-level abstraction. Their cross-domain success, however, is often achieved at the expense of computational cost, high memory bandwidth, and long inference latency, which prevents their deployment in resource-constrained and time-sensitive scenarios, such as edge-side inference and self-driving cars. While recently developed methods for creating efficient deep neural networks are making their real-world deployment more feasible by reducing model size, they do not fully exploit input properties on a per-instance basis to maximize computational efficiency and task accuracy. In particular, most existing methods typically use a one-size-fits-all approach that identically processes all inputs. Motivated by the fact that different images require different feature embeddings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification