SpikeBottleNet: Spike-Driven Feature Compression Architecture for Edge-Cloud Co-Inference
Maruf Hassan, Steven Davy

TL;DR
SpikeBottleNet introduces a spike-driven feature compression method for edge-cloud DNN inference, significantly reducing energy consumption and transmission costs while maintaining high accuracy, by leveraging spiking neural networks and strategic feature encoding.
Contribution
The paper presents a novel spike-driven architecture with a tailored feature compression technique for efficient edge-cloud co-inference, enabling substantial energy and data transmission savings.
Findings
Achieves up to 256x feature compression in ResNet's final layer.
Reduces edge device energy consumption by up to 144x.
Maintains minimal accuracy loss of 0.16% with compression.
Abstract
Edge-cloud co-inference enables efficient deep neural network (DNN) deployment by splitting the architecture between an edge device and cloud server, crucial for resource-constraint edge devices. This approach requires balancing on-device computations and communication costs, often achieved through compressed intermediate feature transmission. Conventional DNN architectures require continuous data processing and floating point activations, leading to considerable energy consumption and increased feature sizes, thus raising transmission costs. This challenge motivates exploring binary, event-driven activations using spiking neural networks (SNNs), known for their extreme energy efficiency. In this research, we propose SpikeBottleNet, a novel architecture for edge-cloud co-inference systems that integrates a spiking neuron model to significantly reduce energy consumption on edge devices.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Machine Learning and ELM
MethodsAverage Pooling · Max Pooling · Global Average Pooling · Kaiming Initialization · Convolution
