Compacting Deep Neural Networks for Internet of Things: Methods and   Applications

Ke Zhang; Hanbo Ying; Hong-Ning Dai; Lin Li; Yuangyuang Peng; Keyi; Guo; Hongfang Yu

arXiv:2103.11083·cs.LG·March 23, 2021

Compacting Deep Neural Networks for Internet of Things: Methods and Applications

Ke Zhang, Hanbo Ying, Hong-Ning Dai, Lin Li, Yuangyuang Peng, Keyi, Guo, Hongfang Yu

PDF

TL;DR

This paper provides a comprehensive overview of methods to reduce the size and computational demands of deep neural networks for IoT devices, covering techniques like compression, knowledge distillation, and structural modifications.

Contribution

It categorizes and compares various DNN compacting techniques specifically for IoT applications, filling a gap in survey literature.

Findings

01

Network model compression effectively reduces DNN size.

02

Knowledge Distillation transfers knowledge to smaller models.

03

Structural modifications improve efficiency without significant accuracy loss.

Abstract

Deep Neural Networks (DNNs) have shown great success in completing complex tasks. However, DNNs inevitably bring high computational cost and storage consumption due to the complexity of hierarchical structures, thereby hindering their wide deployment in Internet-of-Things (IoT) devices, which have limited computational capability and storage capacity. Therefore, it is a necessity to investigate the technologies to compact DNNs. Despite tremendous advances in compacting DNNs, few surveys summarize compacting-DNNs technologies, especially for IoT applications. Hence, this paper presents a comprehensive study on compacting-DNNs technologies. We categorize compacting-DNNs technologies into three major types: 1) network model compression, 2) Knowledge Distillation (KD), 3) modification of network structures. We also elaborate on the diversity of these approaches and make side-by-side…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation