Fully Dynamic Inference with Deep Neural Networks
Wenhan Xia, Hongxu Yin, Xiaoliang Dai, Niraj K. Jha

TL;DR
This paper introduces LC-Net, a dynamic inference framework for deep neural networks that adaptively skips redundant layers and filters per input, significantly reducing computation while improving accuracy.
Contribution
It proposes a novel joint dynamic inference approach with L-Net and C-Net that adaptively selects network components per input, outperforming existing methods in efficiency and accuracy.
Findings
Up to 11.9× fewer FLOPs on CIFAR-10 with higher accuracy.
Achieves up to 1.4× fewer FLOPs on ImageNet with better Top-1 accuracy.
Outperforms state-of-the-art dynamic inference frameworks in efficiency and accuracy.
Abstract
Modern deep neural networks are powerful and widely applicable models that extract task-relevant information through multi-level abstraction. Their cross-domain success, however, is often achieved at the expense of computational cost, high memory bandwidth, and long inference latency, which prevents their deployment in resource-constrained and time-sensitive scenarios, such as edge-side inference and self-driving cars. While recently developed methods for creating efficient deep neural networks are making their real-world deployment more feasible by reducing model size, they do not fully exploit input properties on a per-instance basis to maximize computational efficiency and task accuracy. In particular, most existing methods typically use a one-size-fits-all approach that identically processes all inputs. Motivated by the fact that different images require different feature embeddings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
