BottleFit: Learning Compressed Representations in Deep Neural Networks   for Effective and Efficient Split Computing

Yoshitomo Matsubara; Davide Callegaro; Sameer Singh; Marco Levorato,; Francesco Restuccia

arXiv:2201.02693·cs.LG·September 7, 2022

BottleFit: Learning Compressed Representations in Deep Neural Networks for Effective and Efficient Split Computing

Yoshitomo Matsubara, Davide Callegaro, Sameer Singh, Marco Levorato,, Francesco Restuccia

PDF

2 Repos

TL;DR

BottleFit introduces a novel training framework for deep neural networks that enables high compression of representations with minimal accuracy loss, significantly reducing power consumption and latency in edge devices for image classification.

Contribution

The paper proposes BottleFit, a new approach combining architecture modifications and a training strategy to achieve high compression rates with minimal accuracy loss in split neural networks.

Findings

01

Achieves 77.1% data compression with only 0.6% accuracy loss on ImageNet.

02

Reduces power consumption by up to 49% and latency by up to 89% on NVIDIA Jetson Nano.

03

Outperforms state-of-the-art autoencoder approaches in power efficiency and model size reduction.

Abstract

Although mission-critical applications require the use of deep neural networks (DNNs), their continuous execution at mobile devices results in a significant increase in energy consumption. While edge offloading can decrease energy consumption, erratic patterns in channel quality, network and edge server load can lead to severe disruption of the system's key operations. An alternative approach, called split computing, generates compressed representations within the model (called "bottlenecks"), to reduce bandwidth usage and energy consumption. Prior work has proposed approaches that introduce additional layers, to the detriment of energy consumption and latency. For this reason, we propose a new framework called BottleFit, which, in addition to targeted DNN architecture modifications, includes a novel training strategy to achieve high accuracy even with strong compression rates. We apply…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.