DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework
Shuochao Yao, Yiran Zhao, Aston Zhang, Lu Su, Tarek Abdelzaher

TL;DR
DeepIoT introduces a unified compression framework for various deep neural networks used in sensing applications, significantly reducing model size and energy consumption on embedded devices without sacrificing accuracy.
Contribution
It presents a novel, global-view compression method applicable to multiple neural network types, outperforming existing solutions in efficiency and resource savings.
Findings
Reduces neural network size by up to 98.9%
Shortens execution time by up to 94.5%
Decreases energy consumption by up to 95.7%
Abstract
Recent advances in deep learning motivate the use of deep neutral networks in sensing applications, but their excessive resource needs on constrained embedded devices remain an important impediment. A recently explored solution space lies in compressing (approximating or simplifying) deep neural networks in some manner before use on the device. We propose a new compression solution, called DeepIoT, that makes two key contributions in that space. First, unlike current solutions geared for compressing specific types of neural networks, DeepIoT presents a unified approach that compresses all commonly used deep learning structures for sensing applications, including fully-connected, convolutional, and recurrent neural networks, as well as their combinations. Second, unlike solutions that either sparsify weight matrices or assume linear structure within weight matrices, DeepIoT compresses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Anomaly Detection Techniques and Applications · Target Tracking and Data Fusion in Sensor Networks
