Improved Projection Learning for Lower Dimensional Feature Maps

Ilan Price; Jared Tanner

arXiv:2210.15170·cs.LG·October 28, 2022·1 cites

Improved Projection Learning for Lower Dimensional Feature Maps

Ilan Price, Jared Tanner

PDF

Open Access

TL;DR

This paper proposes an improved projection learning method to compress CNN feature maps below a size limit, enabling more efficient on-chip inference by end-to-end finetuning and folding of learned projections.

Contribution

It introduces a learned projection approach for feature map compression and a ceiling compression framework for on-chip inference optimization.

Findings

01

Achieved significant reduction in feature map size with minimal accuracy loss.

02

Demonstrated the feasibility of fully on-chip CNN inference with compressed feature maps.

03

Provided a new framework for future energy-efficient neural network deployment.

Abstract

The requirement to repeatedly move large feature maps off- and on-chip during inference with convolutional neural networks (CNNs) imposes high costs in terms of both energy and time. In this work we explore an improved method for compressing all feature maps of pre-trained CNNs to below a specified limit. This is done by means of learned projections trained via end-to-end finetuning, which can then be folded and fused into the pre-trained network. We also introduce a new `ceiling compression' framework in which evaluate such techniques in view of the future goal of performing inference fully on-chip.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Stochastic Gradient Optimization Techniques