SpikeBottleNet: Spike-Driven Feature Compression Architecture for   Edge-Cloud Co-Inference

Maruf Hassan; Steven Davy

arXiv:2410.08673·cs.CV·November 8, 2024

SpikeBottleNet: Spike-Driven Feature Compression Architecture for Edge-Cloud Co-Inference

Maruf Hassan, Steven Davy

PDF

Open Access

TL;DR

SpikeBottleNet introduces a spike-driven feature compression method for edge-cloud DNN inference, significantly reducing energy consumption and transmission costs while maintaining high accuracy, by leveraging spiking neural networks and strategic feature encoding.

Contribution

The paper presents a novel spike-driven architecture with a tailored feature compression technique for efficient edge-cloud co-inference, enabling substantial energy and data transmission savings.

Findings

01

Achieves up to 256x feature compression in ResNet's final layer.

02

Reduces edge device energy consumption by up to 144x.

03

Maintains minimal accuracy loss of 0.16% with compression.

Abstract

Edge-cloud co-inference enables efficient deep neural network (DNN) deployment by splitting the architecture between an edge device and cloud server, crucial for resource-constraint edge devices. This approach requires balancing on-device computations and communication costs, often achieved through compressed intermediate feature transmission. Conventional DNN architectures require continuous data processing and floating point activations, leading to considerable energy consumption and increased feature sizes, thus raising transmission costs. This challenge motivates exploring binary, event-driven activations using spiking neural networks (SNNs), known for their extreme energy efficiency. In this research, we propose SpikeBottleNet, a novel architecture for edge-cloud co-inference systems that integrates a spiking neuron model to significantly reduce energy consumption on edge devices.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Machine Learning and ELM

MethodsAverage Pooling · Max Pooling · Global Average Pooling · Kaiming Initialization · Convolution