TL;DR
BottleFit introduces a novel training framework for deep neural networks that enables high compression of representations with minimal accuracy loss, significantly reducing power consumption and latency in edge devices for image classification.
Contribution
The paper proposes BottleFit, a new approach combining architecture modifications and a training strategy to achieve high compression rates with minimal accuracy loss in split neural networks.
Findings
Achieves 77.1% data compression with only 0.6% accuracy loss on ImageNet.
Reduces power consumption by up to 49% and latency by up to 89% on NVIDIA Jetson Nano.
Outperforms state-of-the-art autoencoder approaches in power efficiency and model size reduction.
Abstract
Although mission-critical applications require the use of deep neural networks (DNNs), their continuous execution at mobile devices results in a significant increase in energy consumption. While edge offloading can decrease energy consumption, erratic patterns in channel quality, network and edge server load can lead to severe disruption of the system's key operations. An alternative approach, called split computing, generates compressed representations within the model (called "bottlenecks"), to reduce bandwidth usage and energy consumption. Prior work has proposed approaches that introduce additional layers, to the detriment of energy consumption and latency. For this reason, we propose a new framework called BottleFit, which, in addition to targeted DNN architecture modifications, includes a novel training strategy to achieve high accuracy even with strong compression rates. We apply…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
