Low-memory convolutional neural networks through incremental depth-first   processing

Jonathan Binas; Yoshua Bengio

arXiv:1804.10727·cs.NE·May 22, 2019·1 cites

Low-memory convolutional neural networks through incremental depth-first processing

Jonathan Binas, Yoshua Bengio

PDF

Open Access

TL;DR

This paper presents a novel incremental depth-first processing method for CNN inference that significantly reduces memory usage, making it suitable for embedded systems with strict memory constraints.

Contribution

It introduces a depth-first updating scheme for CNN inference that bounds memory usage and is adaptable to 1D and 2D inputs, enabling low-memory neural network processing.

Findings

01

Memory usage is constant for 1D inputs.

02

Memory scales with the square root of input size for 2D inputs.

03

The method enables CNN inference on memory-limited embedded devices.

Abstract

We introduce an incremental processing scheme for convolutional neural network (CNN) inference, targeted at embedded applications with limited memory budgets. Instead of processing layers one by one, individual input pixels are propagated through all parts of the network they can influence under the given structural constraints. This depth-first updating scheme comes with hard bounds on the memory footprint: the memory required is constant in the case of 1D input and proportional to the square root of the input dimension in the case of 2D input.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Image Enhancement Techniques