Secure and Storage-Efficient Deep Learning Models for Edge AI Using Automatic Weight Generation
Habibur Rahaman, Atri Chatterjee, Swarup Bhunia

TL;DR
This paper presents WINGs, a framework that dynamically generates and compresses neural network weights using PCA and SVR, significantly reducing memory usage while maintaining accuracy for edge AI applications.
Contribution
WINGs introduces a novel weight generation and compression method using PCA and SVR, enhancing security and efficiency for deep learning models on edge devices.
Findings
Achieves 53x compression for FC layers with minimal accuracy loss
Reduces memory by 28x on AlexNet with MNIST dataset
Decreases energy consumption and increases throughput for DNN inference
Abstract
Complex neural networks require substantial memory to store a large number of synaptic weights. This work introduces WINGs (Automatic Weight Generator for Secure and Storage-Efficient Deep Learning Models), a novel framework that dynamically generates layer weights in a fully connected neural network (FC) and compresses the weights in convolutional neural networks (CNNs) during inference, significantly reducing memory requirements without sacrificing accuracy. WINGs framework uses principal component analysis (PCA) for dimensionality reduction and lightweight support vector regression (SVR) models to predict layer weights in the FC networks, removing the need for storing full-weight matrices and achieving substantial memory savings. It also preferentially compresses the weights in low-sensitivity layers of CNNs using PCA and SVR with sensitivity analysis. The sensitivity-aware design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Advanced Memory and Neural Computing
