Secure and Storage-Efficient Deep Learning Models for Edge AI Using Automatic Weight Generation

Habibur Rahaman; Atri Chatterjee; Swarup Bhunia

arXiv:2507.06380·cs.LG·July 10, 2025

Secure and Storage-Efficient Deep Learning Models for Edge AI Using Automatic Weight Generation

Habibur Rahaman, Atri Chatterjee, Swarup Bhunia

PDF

Open Access

TL;DR

This paper presents WINGs, a framework that dynamically generates and compresses neural network weights using PCA and SVR, significantly reducing memory usage while maintaining accuracy for edge AI applications.

Contribution

WINGs introduces a novel weight generation and compression method using PCA and SVR, enhancing security and efficiency for deep learning models on edge devices.

Findings

01

Achieves 53x compression for FC layers with minimal accuracy loss

02

Reduces memory by 28x on AlexNet with MNIST dataset

03

Decreases energy consumption and increases throughput for DNN inference

Abstract

Complex neural networks require substantial memory to store a large number of synaptic weights. This work introduces WINGs (Automatic Weight Generator for Secure and Storage-Efficient Deep Learning Models), a novel framework that dynamically generates layer weights in a fully connected neural network (FC) and compresses the weights in convolutional neural networks (CNNs) during inference, significantly reducing memory requirements without sacrificing accuracy. WINGs framework uses principal component analysis (PCA) for dimensionality reduction and lightweight support vector regression (SVR) models to predict layer weights in the FC networks, removing the need for storing full-weight matrices and achieving substantial memory savings. It also preferentially compresses the weights in low-sensitivity layers of CNNs using PCA and SVR with sensitivity analysis. The sensitivity-aware design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Advanced Memory and Neural Computing