EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network   Accelerators

Lois Orosa; Skanda Koppula; Yaman Umuroglu; Konstantinos; Kanellopoulos; Juan Gomez-Luna; Michaela Blott; Kees Vissers; Onur Mutlu

arXiv:2202.02310·cs.LG·February 7, 2022·1 cites

EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators

Lois Orosa, Skanda Koppula, Yaman Umuroglu, Konstantinos, Kanellopoulos, Juan Gomez-Luna, Michaela Blott, Kees Vissers, Onur Mutlu

PDF

Open Access 1 Repo

TL;DR

EcoFlow introduces specialized dataflows and algorithms that optimize dilated and transposed convolutions for low-power CNN accelerators, significantly improving training efficiency and energy use.

Contribution

EcoFlow presents novel dataflows and mapping algorithms that optimize dilated and transposed convolutions on existing spatial architectures with minimal modifications.

Findings

01

Reduces CNN training time by 7-85%

02

Improves GAN training performance by 29-42%

03

Enhances energy efficiency in low-power CNN accelerators

Abstract

Dilated and transposed convolutions are widely used in modern convolutional neural networks (CNNs). These kernels are used extensively during CNN training and inference of applications such as image segmentation and high-resolution image generation. Although these kernels have grown in popularity, they stress current compute systems due to their high memory intensity, exascale compute demands, and large energy consumption. We find that commonly-used low-power CNN inference accelerators based on spatial architectures are not optimized for both of these convolutional kernels. Dilated and transposed convolutions introduce significant zero padding when mapped to the underlying spatial architecture, significantly degrading performance and energy efficiency. Existing approaches that address this issue require significant design changes to the otherwise simple, efficient, and well-adopted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cmu-safari/sasiml
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning