MEC: Memory-efficient Convolution for Deep Neural Network

Minsik Cho; Daniel Brand

arXiv:1706.06873·cs.LG·June 22, 2017·49 cites

MEC: Memory-efficient Convolution for Deep Neural Network

Minsik Cho, Daniel Brand

PDF

Open Access

TL;DR

This paper introduces MEC, a memory-efficient convolution method that reduces memory overhead and accelerates deep neural network convolution operations by using compact lowering and parallel small matrix multiplications.

Contribution

MEC is a novel convolution algorithm that significantly reduces memory usage and improves speed compared to existing indirect methods.

Findings

01

MEC reduces memory consumption substantially.

02

MEC achieves good speedup on mobile and server platforms.

03

MEC outperforms other indirect convolution algorithms in efficiency.

Abstract

Convolution is a critical component in modern deep neural networks, thus several algorithms for convolution have been developed. Direct convolution is simple but suffers from poor performance. As an alternative, multiple indirect methods have been proposed including im2col-based convolution, FFT-based convolution, or Winograd-based algorithm. However, all these indirect methods have high memory-overhead, which creates performance degradation and offers a poor trade-off between performance and memory consumption. In this work, we propose a memory-efficient convolution or MEC with compact lowering, which reduces memory-overhead substantially and accelerates convolution process. MEC lowers the input matrix in a simple yet efficient/compact way (i.e., much less memory-overhead), and then executes multiple small matrix multiplications in parallel to get convolution completed. Additionally,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Sparse and Compressive Sensing Techniques

MethodsConvolution